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GENERIC:  A  Programming  Language  for  VLSI  Layout 
and  Layout  Manipulation 

Jon  A.  Solworth 
Research  advisor:  Professor  Ralph  Grishman 

Abstract 

We  describe  a  programming  language,  GENERIC  [GENERation  of 
/ntegrated  Circuits)  for  producing  high-quality,  general-purpose  layout  of 
cxistom  integrated  circuits.  Unlike  other  VLSI  programming  languages,  in 
GENERIC,  existing  layouts  can  be  manipulated  by  the  VLSI  operators  to 
produce  new  layouts. 

The  design  of  a  layout  in  GENERIC  starts  with  a  circuit  description 
which  contains  the  active  components  and  electrical  nets.  The  circuit 
description  (sometimes  called  an  abstract  layout)  is  then  transformed  into  a 
realizable  layout  by  the  application  of  VLSI  operators.  These  operators  are 
both  design-rule  safe  and  wire  connectivity  maintaining.  Built-in  operations 
include  relative  placement,  primitive  compaction,  and  orientation.  A  novel 
mechanism  called  planes  is  described,  which  for  the  first  time  enables  non- 
design  rule  violating  topological  manipulations. 
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GENERIC  forms  the  kernel  of  a  VLSI  design  system.  We  also  describe 
the  cell  library,  Flexcell  which  contains  parameterized  and  modifiable  cells. 
Cells  in  the  Flexcell  library  are  created  using  cell  generators,  but  unlike 
traditional  cell  generators,  the  layout  generated  need  not  exhibit  a  high 
degree  of  regularity.  For  each  cell,  a  number  of  templates  are  provided, 
which  encode  known  good  layout  schemes.  Cells  created  with  a  template 
can  then  be  modified  using  utilities  written  in  GENERIC.  Hence,  Flexcell 
provides  highly  optimized  cells  which  can  be  reused  in  msmy  different 
environments. 
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CHAPTER    1 


Introduction 


1.    Introduction 

Integrated  circuit  design  is  a  complex  task.  The  complexity  stems  from 
the  range  of  required  expertise  (architecture,  circuit  design,  layout,  testing), 
the  interplay  of  on-chip  performsmce  factors  (power,  area,  transistor  sizing, 
capacitive  delays),  and  the  size  of  the  problem  (hundreds  of  thousands  of 
transistors)  coupled  with  the  combinatorial  explosion  involved  in  any  brute 
force  attempt  to  solve  the  problem.  This  complexity  has  lead  to  a  crisis  in 
the  design  of  integrated  circuits.  We  will  take  a  look  at  each  one  of  these 
factors'. 

1.1.    Fabrication  technology 

The  complexity  crisis  in  integrated  circuit  design  is  due  to  the  phenome- 
nal increase  of  fabrication  capability.  From  1959,  when  the  first  planar 
transistor  was  produced  until  1986,  the  number  of  transistors  that  can  fit  on 
a  chip  hsis  increased  almost  six  orders  of  magnitude.  Today's  leading  edge 
logic  chips  contain  about  200,000  transistors,  while  state-of-the-art  dynamic 
RAMs  contain  one  megabit  or  about  one  million  transistors.  Gordon  Moore 
has  made  an  observation   known  as  Moore's  Law:  the  number  of  transistors 
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that  can  be  fabricated  on  a  chip  doubles  every  two  years.  This  prediction 
has  proven  accxirate  for  over  two  decades.  Although  there  are  physical  limi- 
tations which  prevent  Moore's  Law  from  continuing  indefinitely,  it  seems 
probable  that  there  are  at  least  two  more  decimal  orders  of  magnitude 
increase  in  transistor  counts,  ultimately  yielding  10  million  transistor  logic 
chips. 

The  advances  in  fabrication  capability  have  been  remarkably  consistent, 
although  the  fabrication  technology  has  changed  over  time.    Technological 
changes  have  occurred  in  every  area  of  fabrication;  in  materials  where  the 
change   was   from  germanium   to  silicon;    in   mask  lithography    where   the 
change  has  been  from  photographically   reduced  drawings  to  direct  electron- 
beam  writing;  and  in  circuit  families,  where  the  change  has  been  from  bipo- 
lar to  pMOS  to  nMOS  now  to  CMOS.    These  changes  are  characterized   by 
first  the  introduction  of  new  technology,  followed  by  several  years  of  honing 
the  technology   until  improvement  slows  and  the  limits  of  that  technology 
are  reached.    At  this  point  a  new  technology  must  be  introduced,  causing  a 
discontinuity  of  technique.    Until  the  late  seventies,  all  of  the  discontinuities 
were  physical  with  the  primary  one  being  lithography.    Today,  the  critical 
bottleneck   appears   to  be  in  design   automation.     The   rest  of  this   chapter 
examines  why  DA  is  the  dominant  problem,  and  what  tools  must  do  in  order 
to  address  it. 


1^.    Historical  influences 

1.2.1.    History  of  Integrated  Circuit  Design  Automation 

As  in  fabrication  technology,  design  technology  has  undergone  a  series 
of  shifts.  The  design  automation  technology  eras  match  the  scale  of  integra- 
tion. From  small  scale  integration,  starting  at  1  gate,  to  ultra  large  scale 
integration  (over  a  1,000,000  gates),  each  new  era  represents  at  least  an 
order  of  magnitude  increase  in  complexity.  Although  no  agreed  upon 
definition  for  VLSI  and  ULSI  exists,  we  define  the  ersis  are  as  follows:  small 
scale  is  1  to  12;  medium  scale  is  12  to  100;  large  scale  is  101  to  10,000;  very 
large  scale  is  10,001  to  1,000,000;  ultra  large  scale  is  over  1,000,000.  Each 
new  era  requires  qualitatively  different  tools  to  attack  the  level  of  complex- 
ity. 

1.2.1.1.    Small  Scale  Integration  (SSI) 

In  the  beginning,  there  were  few  transistors  on  a  chip,  and  the  indivi- 
dual transistors  were  larger  than  in  today's  chips.  The  primary  problem 
was  to  design  circuits  to  implement  simple  logic  functions,  euid  the  models 
for  circuit  behavior  were  not  reliably  understood.  Engineers  would  design 
test  chips,  fabricate  them,  and  physically  probe  them  during  testing.  The 
layout  design  was  relatively  simple,  and  was  produced  manually  by  cutting 
rubylith.  a  dark  plastic.  The  rubylith  was  then  photoreduced  to  the  size  of 
the   fabrication    chips,    to   create    a    mask   for    fabrication.     Computer-aided 
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design  was  not  needed  and  was  not  used. 

1.2.1.2.  Medium  Scale  Integration  (MSI) 

At  some  point  the  rubylith  got  too  big  to  cut  by  hand.  Computers  were 
first  introduced  in  integrated  circuit  design  to  digitize  existing  layouts, 
which  would  then  be  either  plotted  and  photoreduced  or  sent  to  an  electron- 
beam  maskmaker  for  the  creation  of  masks.  Computers  were  not  used  by 
the  design  engineer,  only  by  the  layout  technician,  and  then  only  as  a  draft- 
ing aid. 

1.2.1.3.  Large  Scale  Integration  (LSI) 

By  the  time  LSI  started  appearing,  the  brassboarding  of  chips  by 
engineers  began  to  break  down.  Assuming  pins  and  transistor  sizes  scale 
proportionately,  the  number  of  pins  per  package  grows  linearly  while  the 
number  of  transistors  per  package  grows  as  the  square  of  the  scaling  factor. 
Hence,  there  is  less  observability  at  the  pins  for  testing  and  debugging  pur- 
poses. Moreover,  the  finer  geometries  in  LSI  chips  made  them  harder  to 
physically  probe,  while  also  making  the  number  of  possible  nodes  to  probe 
extremely  large.  Of  course  with  larger  chips,  the  number  of  bugs  per  chip 
increases.  Hence,  both  the  time  taken  to  find  a  bug  and  the  number  of  bugs 
were  growing  larger. 


The  prime  design  advances  of  this  period  were  in  simulation  tools. 
Spice  [VladimirescuSO]  was  used  extensively  to  ensure  that  the  circuit 
implemented  the  logic  function,  and  to  tune  the  performance  of  circuits. 

Another  increasingly  important  type  of  tool  checked  the  consistency 
between  different  views  of  the  design.  A  view  is  an  abstraction  that  allows 
a  designer  to  limit  his  attention  to  one  particular  domain;  examples  of  views 
include  system  design,  logic  design,  circmt  design,  topological  design  and 
geometric  design.  The  two  most  important  and  automated  views  were  cir- 
cuit design  and  layout.  Net  list  comparison  tools  were  required  to  check 
consistency  between  the  physical  layout  which  was  performed  by  technicians 
and  the  circuit  which  was  designed  by  engineers.  This  reduced  the  time 
between  design  iterations  while  increasing  the  effectiveness  of  each  itera- 
tion. 

A  third  class  of  tools  were  the  design  rule  verifiers  which  checked  for 
"legal"  designs.  The  most  prominent  of  these  was  the  geometric  design  rule 
checkers.  If  a  chip  violates  GDRs,  this  does  not  mean  that  the  chip  will  fail 
to  work,  only  that  it  is  less  likely  to.  Even  if  the  percentage  of  design  rule 
violations  committed  by  technicians  stayed  constant,  fabrication  yields 
would  decrease. 

The  primary  role  of  design  automation  during  this  period  was  to  vali- 
date and  to  reduce  the  number  of  design  iterations,   rather  than  to  create 


designs. 

1.2.1.4.    Very  Large  Scale  Integration  (VLSI) 

When  VLSI  started  to  appear,  designs  were  becoming  too  complex  to 
keep  on  paper  and  the  design  process  began  to  move  onto  the  computer. 
With  hundreds   of  thousands   of  transistors  per  chip,   it  became   extremely 
costly  and  error  prone  to  maintain  multiple  different  views  (ex.  layout,  cir- 
cuit and  logic)  manually.    During  this  phase  the  layout  database  became  the 
main  view,  and  the  logic  view  was  provided  by  running  a  circuit  extractor 
on  the  circuit.    Circuit  extractors,  digital  simulators,  and  layout  and  circuit 
design  editors  are  the  tools  of  this  era.    Also,  some  parts  of  the  design  are 
created  automatically.    Rudimentary  cell  generators  started  to  perform  some 
tasks  such  as  PLA  generation,  data  p^th  generation,  and  creation  of  other 
very  regular  layouts.    Routing  tools  (especially  global  routing)  become  more 
important  because  of  both  the  number  of  nets  to  be  connected  and  the  limi- 
tations of  graphics  editors  in  dealing  with  chip-wide  routing.    Design  moves 
away  from  multiple  forms  and  begins  to  derive  from  one  common  or  data 
base  form. 

This  era  is  characterized  by  the  introduction  of  tools  to  implement  and 
improve  designs  automatically. 


1.2.1.5.    Ultra  Large  Scale  Integration  (ULSI) 

The  ULSI  era  is  silicon  compilers,  and  is  the  era  we  are  just  now  enter- 
ing. Extensive  use  is  made  of  automatic  generation  of  large  parts  of  the 
design,  as  well  as  powerful  tools  to  manipulate  the  design.  Integrating  the 
views  of  the  design  is  also  a  primary  theme.  This  thesis  describes  one  such 
ULSI-era  CAD  tool. 

1.2.2.    History  of  Integrated  Circuit  Parts 

During  the  same  time  that  fabrication  advances  were  forcing  changes  in 
the  design  technology,  similsu-  changes  were  wrought  in  the  end  components 
which  were  being  produced.  Integrated  circuits  were  originally  designed  for 
military  and  aerospace  applications  where  their  low  weight  and  small  size 
were  of  critical  importance.  The  steep  learning  curve  enabled  the  rapid 
technological  transfer  to  the  commercial  marketplace.  The  industry  was 
fabrication  driven  since  product  design  was  trivial:  just  package  on  a  chip 
what  had  been  previously  packaged  on  a  module.  This  approach  to  product 
design  required  a  high  degree  of  commonality  between  the  modules  of 
different  users.  Up  until  about  the  LSI  era,  this  strategy  was  simple  to 
implement. 

As  more  devices  were  integrated  on  chip,  CPUs,  memories  and  eventu- 
ally whole  computers  were  packaged  onto  a  single  substrate.  However. 
these    three    t>-pes    of   chips    are    insufficient    to    exploit    the    performance 


capabilities  available  in  VLSI.  New  architectures,  especially  suited  for  VLSI 
technology,  such  as  systolic  structures  [Kung82]  have  been  devised.  But 
these  architectures  are  not  general  purpose  building  blocks;  instead  they  are 
application  specific  designs.  Even  in  their  most  general  invocation  to  date, 
the  Programmable  Systolic  Computer  [Arnould85],  they  are  at  best  special- 
ized array  processors.  In  order  for  systems  designers  to  utilize  the  inherent 
advantages  of  VLSI  it  is  necessary  for  the  system  builder  to  also  become  a 
VLSI  designer. 

Alternatively,  gate  array  and  standard  cell  layout  methodologies  pro- 
vide relatively  inexpensive  ways  to  quickly  design  circuits.  However,  dev- 
ices produced  using  these  methodologies  aj'e  inherently  slower  than  custom 
chips  (by  a  factor  of  2-4)  [HellerSl]  and  hence  can  never  be  used  for  high 
performance  designs.  This  performance  degradation  is  due  to  compromises 
made  in  the  design  of  these  standard  chips.  The  main  causes  of  slowdown 
are: 

(1)  Transistors  are  designed  for  average,  instead  of  optimal  performance. 

(2)  The  routing  area  is  fixed,  leading  to  either  unusable  transistors  or  more 
area  dedicated  to  routing  than  is  necessary. 

(3)  The  wires  are  longer  than  in  custom,  yielding  slower  speeds  than  cus- 
tom. 
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(4)    Technology  constraints  effectively  prohibit  the  use  of  certain  logic  fami- 
lies such  as  nMOS. 

Not  withstanding  these  problems,  at  least  one  supercomputer  manufac- 
turer, ETA  systems  (a  subsidiary  of  CDC)  plans  to  build  their  computer  out 
of  20k  gate  CMOS  gate  arrays  [Lincoln85].  To  compensate  for  some  of  the 
problems  listed  above,  they  will  supercool  the  chips  to  70°  Kelvin. 

The  constraints  imposed  by  this  design  methodology  not  only  affect  the 
electrical  performance  of  the  chips,  but  also  make  certain  architectural 
choices  (such  as  systolic  structures)  unattractive. 

1.3.   What  are  the  requirements  for  ULSI  CAD  tools? 

The  complexity  of  today's  chips  and  the  inability  to  package  off-the-shelf 
component  types  are  the  driving  force  in  VLSI  and  ULSI  CAD  tools. 

1.3.1.    Expertise 

The  required  expertise  is  increasing  for  VLSI  designers.  Because  the 
capacity  of  VLSI  chips  is  rapidly  approaching  that  needed  to  implement  'sys- 
tems on  a  chip',  integrated  circuit  designers  must  understand  the  applica- 
tions for  which  their  chip  is  intended.  In  addition  to  the  application,  the 
designer  must  understand  architecture  and  especially  VLSI  architecture. 
The  architect  must  then  map  his  solution  into  silicon. 


The  mapping  into  silicon  is  not  as  straightforward  as  the  mapping  into 
TTL  components.    In  TTL  design,  the  designer  must  balance  off  parallelism 
against  component  count.  The  MOS  designer  must  not  only  make  this  tra- 
deoff, but  must  also  trade  off  power  versus  speed,  and  this  tradeoff  is  tight. 
State  of  the  art  microprocessors   are  no  longer  limited  by  the  number  of 
transistors  on  a  chip,  but  by  the  amount  of  power  they  can  dissipate.    For 
example,  the  Hewlett-Packard  9000  processor  has  some  450,000  transistors 
but  this  was  accomplished  only  at  the  cost  of  4  to  7  watts  of  power  dissipa- 
tion.   This  enormous  power  consumption  makes  it  impossible  to  use  the  chip 
with  standard  packaging  technology,  and  special  copper  bound  substrate  had 
to  be   devised    to  wick   away   the   heat.     A   processor   produced    with   less 
advanced  technology,  MIPS  uses  about  2  watts,  which  is  the  upper  bound  for 
the  standard  84-pin  package  in  which  it  is  mounted.    This  is  despite  the  fact 
that  MIPS  contains  only  25,000  transistors!  [HennesseySl]  [HP9000j.    Rules 
of  thumb  (based  on  the  technology)  indicate  that  the  9000  should  be  several 
times  faster  than  MIPS.    Yet  MIPS's  performance  is  equivalent  to  HP90005. 
Although  microprocessor  manufacturers  have  now  switched  to  CMOS  to  deal 
with  power  limitations,  the  samie  issues  come  up  again  here  as  even  more 
design  choices  become  available. 

Power  is  not  the  only  problem.  The  time  taken  to  transmit  a  signal  on 
a  wire  can  no  longer  be  considered  negligible.  Signals  traversing  the  length 
of  the  chip  will  take  as  much  time  as  several  transistor  delays;  Fui'thermore. 
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the  relative  time  to  transmit  a  signal  across  chip  increases  with  scaling.  Of 
course,  this  signal  can  be  made  to  propagate  faster  with  the  use  of  large 
drivers,  but  only  at  the  cost  of  increased  power.  The  way  that  the  circuit 
graph  is  planarized  is  critical  to  the  speed  at  which  the  circuit  runs.  Logic 
design  and  packaging  are  much  more  tightly  interwoven  than  in  traditional 
discrete  design. 

In  addition,  there  is  a  tradeoff  between  the  area  spent  on  wiring  versus 
that  available  for  active  components.  These  factors  dramatically  impact  the 
architecture  especially  in  the  area  of  parallel  versus  sequential  structure.  It 
sometimes  consumes  less  area  (and  is  frequently  faster)  to  recompute  a  func- 
tion than  to  transmit  the  output  from  some  other  part  of  the  chip.  Systolic 
structures  often  take  advantage  of  this  property. 

Packaging  and  partitioning  the  design  have  become  much  more  impor- 
tant issues  because  of  the  severe  performance  penalty  in  going  off  chip,  as 
well  as  the  very  limited  number  of  pins  avsiilable. 

Finally,  because  of  the  high  costs  of  design  errors  (new  masks  and  pro- 
tracted turnaround  time),  there  is  a  much  higher  emphasis  on  design  valida- 
tion. 

This  enormous  interdependency  of  the  phases  of  the  design  dictated 
close  communication  among  the  design  team  in  order  to  achieve  an  high 
quality    integrated    circuit.     The   way   in   which    information    flows   be:ween 
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members  of  the  group  and  is  validated  is  essential  to  the  success  of  the  chip. 

We  are  in  the  midst  of  a  large  number  of  design  discontinuities,  brought 
on  by  both  physical  and  human  issues.  These  discontinuities  invalidate  the 
old  methodology  for  problem  solving  before  new  methodologies  are 
developed. 

1.3.2.    New  Medium 

As  new  media  become  available  to  designers  steeped  in  the  old  design 
methodologies,  the  designers  must  learn  to  use  the  strengths  of  the  new 
medium.  Ivan  Sutherland  presents  the  following  analogy:  When  steel  first 
became  available,  all  the  then  existing  bridge  builders  had  been  trained  to 
build  bridges  using  stone.  The  new  medium,  steel,  was  used  as  a  one-for-one 
replacement  in  building  the  stone  arches  out  of  steel.  But  after  a  time  there 
was  a  flash  of  insight,  the  stone  arches  were  inverted,  and  the  suspension 
bridge  was  invented. 

Integrated  circuit  designers  will  be  forced  to  find  new  structures,  metho- 
dologies and  models  in  order  to  take  advantage  of  the  inherent  power  of 
VLSI.  CAD  methodologies  must  support  existing  design  styles,  allow  new 
design  styles  to  be  discovered,  and  be  able  to  abstract  the  crucial  properties 
from  the  underlying  media. 
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1.3.3.    Combinatorics 

Many  of  the  computational  problems  in  VLSI  design,  and  in  layout  in 
particular  have  been  shown  to  be  NP-hard.  This  includes  such  problems  as 
optimal  PLA  folding,  routing,  and  routing  and  placement  [Luby82,Pinter83]. 
In  addition  to  the  inherent  "intractability"  of  VLSI  sub-problems,  the  size  of 
VLSI  problems  in  terms  of  number  of  edges,  nodes,  etc.  tends  to  be  large,  on 
the  order  of  several  hundred  thousand  elements  or  more. 

These  two  factors  make  it  \inlikely  that  a  general  purpose,  monolithic 
approach  to  automate  the  design  process  will  be  successful.  However,  there 
are  a  number  of  factors  which  ameliorate  this  problem: 

(1)  Specialized  methodologies:  some  easily  computable  algorithms  are  avail- 
able for  specialized  problems  which  occur  often  in  practice.  Examples 
include  PLA  generation,  channel  routing  and  layout  of  some  standard 
components  such  as  adders,  multipliers,  etc. 

(2)  Approximation  algorithms:  Results  obtained  should  be  good  but  need 
not  be  optimal.  Notice  that  the  type  of  problems  which  arise  are  typi- 
cally not  of  the  form  "is  this  set  of  clauses  satisfiable?",  but  rather  "find 
a  solution  of  this  problem  making  the  area  as  small  as  possible."  Exact 
solutions  to  these  problems  are  not  needed  aj\d  hence  can  be  approxi- 
mated. 
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1.3.4.  Computer  Aided  Design 

What  should  computer  aided  design  tools  do  for  designers?  Because  of 
the  inherent  complexity,  it  is  impractical  to  attack  the  problem  by  brute 
force.  And  yet  the  problem  is  too  large,  and  not  enough  is  known  about  the 
solutions  to  apply  heuristics  to  the  design  of  whole  chips.  Instead,  we  need 
strategies  which  keep  the  designer  in  the  loop.  Tools  for  this  purpose 
should: 

(1)  Support  natural  models  of  the  design  abstractions. 

(2)  Provide  smooth  mechanisms  to  span  the  different  design  views 

(3)  Provide  predictive   (as  opposed  to  validation)    feedback  on  timing  and 
other  design  criteria 

(4)  Keep  track  of  details 

(5)  Allow  the  designer  to  use  any  design  style  and  to  produce  high  quality 
designs. 

(6)  Be  efficient. 

1.3.5.  Naturalness 

The  models  used  for  describing  the  design  should  be  natural.  This 
naturalness  criterion  includes  several  aspects  —  the  model  must  match  the 
designer's  understanding,  it  must  provide  mechanisms  to  describe  all  of  the 
structures  and  phenomena  necessary  and  it  should  be  as  simple  as  possible. 
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The  views  provided  should  be  easily  understood  and  give  insight  into  the 
strength  and  weaknesses  of  the  design.  In  essence,  we  would  like  the 
designer  to  be  able  to  "feel"  the  design  in  its  entirety.  Since  the  design  is 
being  developed,  the  model  must  be  dynamic,  allowing  the  designer  to  apply 
operations  to  the  design  to  attain  new  designs.  Moreover,  the  effect  of 
operators  applied  to  the  design  should  have  a  consistent  effect  so  that  it 
doesn't  surprise  the  designer. 

1.3.6.   Spanning  Design  Views 

The  process  of  designing  a  complex  system  requires  the  creation  of 
several  views  or  levels  of  abstraction.  The  purpose  of  these  levels  of  abstrac- 
tion is  to  provide  a  handle  on  the  goals  of  design  at  the  lowest  level.  As  we 
descend  the  design  process,  the  design  database  becomes  increasingly  larger. 
If  the  translation  is  done  by  hand,  then  there  is  a  chance  for  error,  particu- 
larly at  the  lower  levels  of  design.  It  is  most  effective  if  this  translation  can 
be  done  automatically,  especially  since  this  translation  will  have  to  be  done 
multiple  times  during  the  design  process.  However  sufficient  semantic  infor- 
mation must  be  present  so  that  this  translation  can  be  done  unambiguously. 

Direction  of  information  flow  is  important.  Functional  simulation  can 
logically  be  done  before  layout.  However,  with  existing  tools  a  transistor- 
level  schematic  description  must  be  entered  into  the  system,  and  then  simu- 
lated.   The  layout  is  performed  as  a  separate  process,  and  the  two  circuits 
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must  be  compared,  often  by  a  netlist  comparison  program.  There  is  no 
assurance  that  the  two  descriptions  match,  since  one  is  derived  from  artwork 
and  the  other  is  hand  coded.  Thus,  errors  must  be  manually  corrected  until 
the  two  descriptions  are  consistent,  and  this  is  a  tedious  process. 

1^.7.    Predictive  Models 

Another  goal  for  a  methodology  for  producing  chips  is  that  information 
should  be  available  early  in  the  design  process.  We  would  like  to  evaluate 
the  partially  completed  design  without  going  through  the  full  design  cycle. 
For  example,  timing  verification  of  integrated  circuits  is  typically  not  possi- 
ble until  after  layout  is  completed.  Once  the  layout  is  completed,  circuit 
extractors  "reverse  engineer"  the  circuit  finding  the  transistors,  wires  and 
parasitic  capacitors.  This  "reversing  engineering"  is  required  because  of 
inadequate  representation  of  the  design  and  abstraction  transitions  at 
higher  levels  and  because  of  the  sensitivity  of  circuit  design  to  parasitics. 
After  the  extraction,  tools  such  as  Crystal  [Ousterhout83]  or  TV  [JoupiiSS] 
provide  good  static  timing  ansdysis. 

These  criteria  should  instead  be  a  more  integral  part  of  the  design  cycle. 
Early  information,  which  is  predictive,  could  provide,  at  any  state  in  the 
design  process  a  measure  of  the  probable  performance  of  the  circuit  For 
simulation,  estimation  of  parasitics  could  find  probable  critical  paths  which 
would  enable  the  designer  to  focus  on  circuit  and  layout  technique^  to  '•peed 
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up  the  critical  path.  As  the  design  progressed,  the  error  of  this  probabihstic 
measure  would  decrease.  This  would  reduce  the  number  of  cycles  in  which 
chips  are  designed  and  tediously  laid  out,  followed  by  re-designs  and  re- 
layout. 

Earlier  introduction  of  design  information  and  better  dictions  enables 
the  design  of  more  finely  honed  chips  with  a  substantial  decrease  in  design 
cost. 

1.3.8.    Details 

When  designing  a  chip  with  a  million  transistors,  there  are  going  to  be 
a  lot  of  details.  Adequate  naming  and  hierarchical  structuring  are  required. 
Integrated  views  of  the  design  are  needed  so  that  designers  can  flip  back  and 
forth  between  levels  of  the  design.  Screening  mechanisms  are  required  to 
cut  down  on  the  amount  of  textual  or  visual  clutter.  On  a  more  global  level, 
we  need  mechanisms  of  abstraction  which  allow  us  to  form  models,  and  to 
talk  about  things  approximately.  Another  important  aspect  is  that  our 
models  must  accurately  and  simply  describe  the  chip. 

1.3.9.    Design  Methodology 

The  design  system  should  not  constrain  the  design  methodology.  This  is 
particularly  important  when  the  technological  pace  is  rapidly  introducing 
design  discontinuities.    The  old  methodologies  no  longer  work,  while  the  new 


ones  are  in  the  process  of  development.     The  design  system  also  should  be 
usable  by  as  large  a  community  as  possible. 

1.3.10.    Efficiency 

It  is  crucial  that  the  tools  for  VLSI  design  are  "efficient".  The  absence 
of  this  constraint  makes  satisfaction  of  all  other  criteria  trivial;  just  perform 
a  brute  force  search  of  the  design  space.  Our  method  for  keeping  the  design 
process  efficient  consists  of  the  following: 

(1)  Keep    in   the   data   structures   enough    information    so   that   often    (and 
expensively)  computed  values  are  stored,  rather  than  recalculated. 

(2)  Make  the  operators  of  the  language   small  and  simple  enough  so  that 
they  are  not  inherently  inefficient  to  calculate. 

1.4.    What  is  GENERIC? 

GENERIC  is  the  kernel  of  a  system  that  is  intended  to  satisfy  these  cri- 
teria for  layout.    GENERIC  is  a  tool  for  building  tools. 
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CHAPTER    2 


Other  methodologies 


2.   Other  methodologies 

Our  goal  is  to  be  able  to  produce  extremely  high  quality  integrated  cir- 
cuits at  substantially  lower  cost  than  is  possible  today.  These  integrated  cir- 
cmts  are  ones  in  which  performance  is  paramount  such  as  microprocessors 
and  other  computing  components,  or  ones  which  will  be  produced  in  very 
high  volume  such  as  engine  control  circuitry  for  a  major  car  manufacturer. 
Concomitantly,  we  would  like  to  increase  the  quality  of  these  components. 

In  this  chapter  that  part  of  the  design  space  is  explored  which  is  either 
capable  of  producing  high  quality  designs  (albeit  at  high  cost)  or  those  which 
fall  short  but  might  be  extended.  A  tree  representing  the  relevant  part  of 
the  design  space  is  shown  in  figure  2.1.  The  purpose  of  this  exploration  is  to 
understand  why  capable  design  technology  has  high  cost  and  why  lower  cost 
technology  is  not  capable. 
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Figure  2.1  —  Design  space  for  high  quaUty  integrated  circxiit  production. 

The  root  node  of  the  tree  is  the  design  space.  At  the  second  level  is  the 
set  of  design  methodologies,  arranged  in  order  of  increasing  complexity.  The 
methodologies  are  gate  array,  standard  cell,  silicon  compilation,  and  hand- 
crafted custom  design.  They  are  discussed  in  section  2.1  in  terms  of  their 
design  cost,  quality  of  circuits  produced,  and  scalability.  We  shall  use  the 
term  custom  to  indicate  that  the  layout/circuit  design  is  customized  for  the 
design,  and  not  in  the  sense  of  requiring  custom  manufacturing  (i.e.  masks). 
We  conclude  that  gate  array  and  standard  cell  are  inherently  incapable  of 
meeting  our  design  requirements.  The  choice  is  therefore  between  hand- 
honed  custom  (at  high  cost)  or  silicon  compilation  'at  lower  quality). 
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In  section  2.2,  we  examine  the  state  of  the  art  in  silicon  compilation. 
The  focus  is  on  the  set  of  tools  for  automatic  layout  generation.  These 
include  compaction,  cell  generation,  and  place  and  route. 

Since  silicon  compilation  is  currently  insxifficient  to  produced  the 
desired  quality  of  circuits,  we  examine  both  the  tools  of  and  the  capabilities 
in  hand-honed  custom  design  to  understand  what  is  required  for  the  task. 
The  tools  consists  of  a  whole  suite  of  programming  languages  and  graphics 
editor  creation  tools  and  of  validation  tools.  The  tools  are  discussed  in  sec- 
tion 2.3.  The  capabilities  are  the  layout  styles  that  a  designer  will  employ; 
clearly,  the  larger  this  set  is  the  more  flexibility  and  likelihood  to  produce 
high  quality  designs.    The  layout  styles  are  described  in  section  2.4. 

2.1.   Design  methodologies 

In  this  section  we  will  briefly  review  the  advantages  and  limitations  of 
existing  design  methodologies. 

2.1.1.    Gate  Arrays 

Clearly  the  most  popular  of  the  approaches  to  custom  or  semi-custom 
design  is  the  use  of  gate  arrays.  Gate  arrays  are  prefabricated,  unconnected 
transistors  which  can  be  wired  together  into  desired  circuits  when  the  final 
metal  layers  are  patterned  on  the  chip.  Since  the  single  largest  one-time 
cost  in  the  manufacturing   of  integrated   circuits  is  the  cost  of  masks,  gate 


21 


arrays  reduce  the  costs  by  reducing  the  number  of  custom  masks  needed. 
Although  the  performance  of  these  chips  is  lower  than  full  custom,  this  is  an 
attractive  technology  for  small  production  runs. 

The  interface  to  the  designer  in  a  typical  gate  array  CAD  system  is 
much  like  that  using  standard  TTL  parts.  The  user  specifies  a  TTL  design 
which  can  either  be  simulated  or  breadboarded.  The  design  can  then  be  the 
mapped  by  the  software,  using  libraries  of  implementations  for  TTL  parts, 
onto  the  gate  su-ray  and  wired  automatically.  Since  the  TTL  library  struc- 
tures are  designed  by  hand,  the  CAD  problem  reduces  to  placement  and 
routing. 

For  low  volume,  low  performance  designs,  the  simplicity  of  the  tech- 
nique makes  it  the  technology  of  choice.  For  high  volume  parts,  the  smaller 
die  size  makes  custom  more  attractive.  The  question  which  remains  is:  what 
is  the  best  way  to  build  high  speed,  low  volume  parts?  Techniques  such  as 
Multi-Project  Wafers  drastically  reduce  the  cost  of  mask  making.  Therefore. 
the  only  reason  not  to  use  full  custom  techniques  for  these  parts  is  if  the 
CAD  technology  cannot  bridge  the  design  cost  differential  between  custom 
and  non-custom  techniques. 

2.1.2.    Standard  Cells 

To  increase  the  performance  of  designs  over  that  which  is  attainable 
with  gate  arrays,  some  vendors  and  systems  houses  have  turned  to  itandard 
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cell   design.     This  technique   uses  hand   designed    standard   cells   (requiring 
custom  fabrication)  which  are  then  placed  and  routed. 

Each  cell  is  more  highly  optimized  than  in  the  gate  array  case,  and 
monolithic  regular  structures  such  as  RAMs,  ROMs,  and  PLAs  can  be  gen- 
erated on  the  chip.  Furthermore,  the  size  of  wiring  channels  are  not  fixed  in 
advance,  and  so  less  area  is  wasted  for  wiring  and  there  are  no  unwirable 
transistors.  Like  gate  arrays,  it  is  a  bottom-up  design  strategy  and  so  at 
high  levels  of  integration  it  approaches  the  inefficiencies  of  gate  arrays  since 
the  cells  are  not  adequately  composible.  For  exsunple,  one  manufacturer 
must  use  over  70%  of  the  area  for  wiring  channels.  This  could  be  reduced  by 
building  more  types  of  large  structures,  but  then  we  have  come  full  cycle: 
what  structures  should  be  built? 

It  is  clear  that  neither  gate  arrays  nor  standard  cells  solve  the  problem 
of  how  to  create  custom  integrated  circuits:  they  only  pjostpone  it. 

2.1.3.    Silicon  compilers 

The  most  appealing  of  all  approaches  is  the  silicon  compiler,  many  of 
which  have  been  described  in  the  literature.  Ideally,  a  silicon  compiler 
takes  as  input  a  concise  behavioral  description  of  the  function  to  be  imple- 
mented £Lnd  produces  a  high  quality,  custom  integrated  circuit. 

If  silicon  compilers  produced  circuits  comparable  to  hand  crafted,  there 
would  be  no  need  for  hand  crafted  design.s.    However,  state  of  the  art  silicon 
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compilers  are  no  match  for  the  custom  integrated  circuit  designer.  To 
reduce  complexity  and  increase  quality  of  design,  today's  silicon  compilers 
have  specialized  input  specification  languages  and/or  target  architectures. 
For  example,  Bristle  Blocks  [Joh£inssen79]  (the  first  silicon  compiler)  used  a 
standard  cell  style  layout  system.  Another  compiler,  PLEX  [Buric83]  pro- 
duces a  parameterized  microprocessor  to  be  packaged  on-chip  with  other 
logic  components.  PLEX's  parameterization  is  very  simple,  essentially 
changing  the  width  of  the  datapath  or  modifying  the  structure  of  the  control 
store.  Other  silicon  compilers,  for  example  F.I.R.S.T.  [BergmainnSS],  are  tar- 
geted towards  a  particular  problem  domain,  in  this  case  signal  processing. 

The  range  of  automatic  layout  techniques  for  silicon  compilers  is 
extremely  narrow,  and  is  limited  to  place  smd  route,  compaction,  and  cell 
generation.  It  is  difficult  to  measure  the  quantitative  effective  loss  of 
automatic  techniques,  since  there  are  no  standard  benchmarks.  However, 
none  of  the  above  mentioned  three  techniques  are  capable  of  producing  area 
efficient  cells  for  massive  replication.  Another  problematic  area  for  silicon 
compilers  is  the  production  of  systolic  designs,  with  their  tight  interleaving 
of  memory,  data  and  control.  Hence,  more  general  purpose  layout  tools  are 
needed  to  meet  these  challenges. 
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2.1.3.1.    Compaction  based  silicon  compilers 

One  compaction  based  silicon  compiler,  Dumbo  [Wolf83]  produces  layout 
from  circuit  schematics.  Using  force-directed  placement,  followed  by  routing 
and  compaction,  this  compiler  was  able  to  achieve  layouts  averaging  2.2 
times  the  size  of  those  produced  by  hand.  The  hand  produced  cells  are  from 
the  Stanford  Cell  Library  [NewkirkSS],  and  vary  from  6  to  24  transistors. 

Although  experiments  were  not  run  on  larger  cells,  the  penalty  is  likely 
to  increase  as  sizes  scale  up.  This  enormous  increase  in  size  is  mainly  due 
to  the  automatic  placement  algorithms.  If  layout  begins  from  (manual) 
placement,  the  penalty  is  1.6  times  that  of  hand  layout.  These  increases  are 
for  area;  because  wiring  delays  (RC)  grow  quadratically,  and  assuming 
manhattan  routing,  the  wire  delay  penalty  will  be  four  times  the  area 
penalty.  (This  is  because  wire  length  from  one  corner  of  a  square  to  another 
would  be  2*^ area ,  which  would  then  be  squared  to  get  wire  delay.) 

It  is  likely  that  there  are  better  initial  placement  techniques  than  the 
one  used  in  Dumbo.  One  based  on  graph  planarity  is  described  in  a  later 
section;  this  is  an  important  area  for  research. 

2.1.3.2.    Cell  generation  silicon  compilers 

Compaction  techniques  use  graph  algorithms  to  embed  circuits  into  a 
topology.  An  alternative  approach,  cell  generation  uses  known  "good"  topo- 
logies  for  producing   a  family   of  layouts.     The  result   is  designs   which   are 
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tuned  simultaneously  for  both  layout  and  circuitry.  An  example  is  a  fast 
adder  with  different  sized  drivers  in  the  summing  tree  to  balance  against 
different  length  wires.  When  this  technique  is  applicable,  it  should  be  more 
effective  than  pure  compaction  based  techniques. 

A  typical  generation-style  silicon  compiler  is  Macpitts  [Siskind82].  The 
Macpitts  compiler,  designed  for  signal  processing,  produces  a  floorplan  com- 
posed of  two  sections:  a  data  path  aoid  a  control  path.  The  data  path  uses  a 
datapath  generator  to  produce  the  arithmetic  functions.  The  control,  instead 
of  using  the  more  traditional  PLAs,  uses  a  Weinberger  Array  [Wein- 
bergers?]. Cell  generation  by  Weinberger  Arrays  uses  only  nor-gates,  laid 
out  in  a  standard  strips  type  format. 

Once  again  it  is  difficult  to  measure  the  performance  of  the  MacPitts 
methodology.  A  later  work  [Fox83]  benchmarks  the  MacPitts  compiler 
against  existing  designs  for  the  applications  area  in  which  MacPitts  was 
intended.  Macpitt's  base  cells  were  redesigned  and  the  method  for  creating 
the  control  section  was  modified;  performance  was  increased,  at  least  for 
some  chips  by  one  order  of  magnitude.  Area  was  also  decreased  by  replace- 
ment of  the  Weinberger  Array  with  a  standard  cell  layout,  and  through 
modifications  in  the  datapath  design.  However,  even  after  extensive 
modifications,  the  compiler  produced  layouts  ion  the  two  examples'  which 
were  5  to  8  times  larger  than  tho~e  produced  by  standard  cell  desigr.     Even 
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worse,  the  larger  penalty  was  on  the  larger  cell,  which  could  indicate  that 
the  area  penalty  is  superlinear. 

2.1.3.3.    Silicon  compilation  vs.  compilation 

It  is  worth  comparing  silicon  compilation  against  the  more  traditional 

software  compiler.    The  most  important  determinate  in  the  efficiency  of  the 

code  produced  by  compilers  is  the  semantic  match  between  the  programming 

language  and  the  execution  model.   This  can  be  further  broken  down  into: 

primitive  match.    Are  the  statements  of  the  programming  language  easily 
mapped  into  the  primitives  of  the  execution  model,  and 

semantic  gap.  Are  programming  language  statements  mappable  into  a 
succinct  list  of  instructions. 

The  match  of  primitives  requires  a  natural  correspondence  between  the 
language  and  the  underlying  execution  model.  For  example,  almost  all  com- 
puters perform  arithmetic  most  efficiently  on  some  fixed  number  of  bits. 
Languages  in  which  every  number  is  likely  to  be  a  different  size  are  not 
likely  to  execute  efficiently  on  such  a  machine  and  such  arbitrary  precision 
arithmetic  is  generally  not  supported  by  standard  programming  languages. 
For  silicon  compilation  the  notions  of  precision,  parallelism  and  pipelining 
must  be  adequately  supported.  While  important,  this  issue  is  beyond  the 
scope  of  this  thesis. 

To  adequately   span  the  semantic  gap  of  language   to  execution   model, 
the    phases    of  the    compilation    procesi    must    be    understood.      In    software 
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compilers  the  phases  are  typically:  lexical  analysis,  parsing,  optimization 
and  code  generation.  For  silicon  compilers,  code  generation  must  be  further 
divided  into  logic,  circuit  and  layout  design.  These  three  phases  are  both 
very  large  and  not  well  understood. 

Logic  design  is  the  study  of  time-space  tradeoffs  and  is  beyond  the  scope 
of  this  thesis.  Circuit  and  layout  design  are  usually  done  in  the  same  phase, 
using  one  or  a  hybrid  of  three  techniques:  compaction,  place  and  route  or  cell 
generation.    These  techniques  are  discxissed  in  section  2.2. 

In  cases  where  either  the  primitive  match  is  low  (such  as  Fortran  com- 
pilers for  parallel  machines)  or  where  the  semantic  gap  is  high,  extensive 
use  of  optimization  techniques  must  be  used.  These  systems  modify  the 
object  code  by  correctness  preserving  transformations. 

We  believe  that  the  distance  in  abstraction  level  between  functional 
description  of  the  chip  and  the  target  layout  is  as  large  as  that  between  very 
high  level  languages  and  traditional  architectures.  As  a  result,  production 
quality  silicon  compilers  must  be  capable  of  performing  aggressive  optimiza- 
tions. No  silicon  compiler  we  know  of  transforms  layouts  into  other  layouts; 
we  believe  this  to  be  a  critical  component  of  high  quality  silicon  compilers. 

2.1.4.    Hand  crafted  custom  designs 

When  large  volume,  high  performance  parts  are  required,  hand-crafted 
custom   design    is   the   methodology    of  choice.     Chips   can    be   optimized    for 

28 


speed/area/power  tradeoffs,  and  when  combined  with  high  capability 
manufacturing  provide  the  cost  or  performance  advantages  that  manufactur- 
ers require  to  achieve  market  share. 

However,  with  the  exception  of  memory  products,  these  designs  rely  on 
CAD  tools.  Critical  tools  include:  graphics  editors,  simulators,  geometric 
and  electrical  rule  checkers,  net  list  comparitors  and  design  data  bases. 
Moreover,  each  transistor  used  in  the  custom  design  is  not  hand-drawn: 
extensive  use  is  made  of  generators  and  routers.  We  shall  examine  those 
techniques  which  are  available  to  the  custom  designer  for  automatically  gen- 
erating parts  of  the  design  in  section  2.3. 

2.2.    Automatic  layout  generation 

In  a  previous  section,  several  silicon  compilers  were  discussed  along 
with  the  mechanisms  they  employed  for  generating  layout.  In  this  section  a 
more  detailed  examination  of  the  mechanisms  is  given,  and  a  discussion  of 
their  inherent  limitations.  The  mechanisms  of  GENERIC  can  be  best  under- 
stood in  comparison  to  these  mechanisms. 

2.2.1.    Compaction 

Compaction  takes  a  loose,  or  sticks  layout  [WilliamsSO]  and  produces  a 
tight  layout  which  is  area  efficient.    Compaction  is  an  effective  technique  for 

custom  design,  and  several  production  systems  make  e.xtensive  use  of  it.  for 
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example  VIVID  [Rogers83]. 

However,  compaction  requires  a  predetermined  topology.  Of  the  two, 
compaction  and  topological  graph  planarization,  topological  graph  plangiriza- 
tion  is  clearly  the  harder  issue.  If  compaction  is  to  be  used  in  silicon  com- 
pilers, good  topology  determining  methods  must  be  found. 

Compaction  methodologies  tend  to  perform  one  dimensional  compaction 
since  two  dimensional  compaction  is  NP-hard  [Sastry82].  Compaction  based 
schemes  are  of  two  types,  virtual  grid  or  critical  path. 

2.2.1.1.    Compaction  techniques 

2.2.1.1.1.    Virtual  grid  compaction 

In  virtual  grid  compaction,  each  component  is  assigned  to  an  integer  x-y 
coordinate  on  a  virtual  grid.  Components  do  not  move  between  coordinates 
on  the  virtual  grid:  the  compaction  is  performed  by  spacing  the  grid  lines  as 
close  as  possible  together  on  the  real  grid.  In  figure  2.2  (a)  is  an  initial  vir- 
tual grid  layout  for  the  cell.  Virtual  grid  lines  cannot  interchange;  if  line  i 
is  to  the  left  of  line  j,  then  after  compaction  all  objects  on  line  i  will  be  to  the 
left  of  objects  of  line  j.  Since  components  are  unable  to  move  between 
different  grid  lines,  this  technique  is  less  effective  than  the  critical  path 
technique  (to  be  described  below\  but  requires  only  linear  run  time.  When 
compacted,  this  results  in  the  layout  shown  in  figure  2.2  'bi,  where  obvious 
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improvements    can    be    made.     Examples    of  virtual    grid    systems    include 
Mulga  [WesteSl]  and  HCOMPACT    [Entenman83]. 


(a) 


(b) 


Fig\ire  2.2  —  Virtual  grid  compaction. 
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2.2.1.1.2.    Critical  path  compaction 

In  critical  path  algorithms  the  layout  is  viewed  as  a  planar  graph  in 
which  components  are  scheduled  to  locations  to  reduce  the  longest  path.  The 
problem  is  reduced  to  a  set  of  linear  constraints  of  the  form: 

where  x,  is  the  x  coordinate  of  object^  (similarly  for  j),  and  u^  is  the 
minimum  spacing  between  the  two  objects.  This  constrains  objects  to  reside 
in  half  planes  due  to  the  transitivity  of  inequality.  Examples  of  this  tech- 
nique are  Cabbage  [Hseuh69]  or  Lava  [Mathews82].  Since  it  is  only  feasible 
to  perform  one-dimensional  compaction,  compaction  proceeds  alternately 
along  X  and  y  axis.  However,  because  only  one  dimension  is  considered  at  a 
time,  the  problem  is  overconstrained.  For  example,  in  figure  2.3  (a)  there 
are  two  constraints,  only  one  of  which  needs  to  be  satisfied. 

Kedem  and  Wantanabe  [Kedem83]  examined  a  method  to  eliminate  the 
overconstrained  problem  shown  in  2.3  (a).  A  decision  variable  is  assigned  to 
each  pair  of  constraints.  Each  variable  takes  on  the  value  in  {0,1}  indicating 
which  of  the  pair  of  constraints  are  to  be  satisfied  (2.3  (b)  or  (c)),  and  the 
problem  can  then  be  solved  as  in  other  compactors.  Their  backtracking  solu- 
tion, although  worse  case  exponential,  runs  in  reasonable  time. 
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(a)  (b)  (c) 

Figure  2.3  —  Overly  constrained  one  dimensional  compaction. 


2^.1.2.    Compaction  limitations 

Compaction  is  a  coarse  strategy:  it  is  performed  globally  on  the  sticks 
representation  as  an  optimization  problem.  For  example,  compaction  based 
strategies  are  unable  to  perform  even  minor  topological  or  replacement  func- 
tion  to  improve  the  quality  of  the  layout.  Moreover,  because  of  the  way  con- 
straints are  extracted  from  the  layout,  it  is  not  even  possible  to  achieve  all 
non-topological  manipulations.  In  figure  2.4,  a  topological  manipulation 
(possible  in  GENERIC)  is  performed  to  decrease  the  area  or  reduce  the 
capacitance/resistance  along  a  wire. 
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(a)  (b) 

Figure  2.4  —  A  GENERIC  topological  manipulation  used  in  compaction. 

Figure  2.5  (a)  shows  a  circuit  produced  by  Kedem  and  Wantanabe 
[Kedem83].  By  replacing  the  pull-ups  with  source  connected  to  gate  in  2.5 
(a)  with  a  pull-up  and  a  poly-diffusion  contact,  area  can  be  reduced  as  showTi 
in  2.5  (b). 


34 


(a)  compacted  layout  (b)  use  excess  horizontal  space 

to  decrease  compacted  height 

Figure  2.5  —  Replacing  a  primitive  pull-up  with  two  components 

Even  if  optimal  two  dimensional  compaction  were  tractable,  there 
remains  the  very  difficult  problem  of  selecting  a  starting  topology.  Ng  and 
Johnsson  [Ng85]  have  examined  techniques,  based  on  graph  planarity  test- 
ing [Hopcroft74],  to  compute  good  topologies  for  circuits.  The  assumption  is 
that  the  more  planar  a  circuit  is,  the  better  its  layout  is.  However,  this 
metric  is  a  heuristic  which  does  not  take  into  account  the  size  of  objects  nor 
that  the  resulting  circuit  should  be  a  rectangle,  presumably  with  reasonable 
aspect  ratio. 


35 


2.2.2.  Routing 

Routing  takes  as  input  a  set  of  cells  which  are  placed  (but  not  neces- 
sarily connected)  and  a  wiring  list  of  terminals  that  need  to  be  connected, 
and  produces  a  layout  for  those  connections  using  wiring  layers  which 
satisfies  geometric  design  rules.  A  detailed  description  of  routing  is  beyond 
the  scope  of  this  thesis.  Because  routing  technology  is  at  present  incapable 
of  producing  high  quality  cells  (which  require  a  combination  of  place  and 
route),  this  technique  is  only  of  use  for  connecting  together  large  cells, 
where  the  percentage  loss  due  to  this  limitation  is  small. 

2.2.3.  Place  and  route 

Place  and  route  is  similar  to  routing,  but  has  the  added  degree  of  free- 
dom that  the  determination  of  the  cell  location  and  orientation  is  under  the 
control  of  the  algorithm.  Place  and  route  is  the  technology  used  in  gate 
array  or  standard  cell  designs.  As  discussed  previously,  chip-wide  use  of 
these  techniques  is  not  an  effective  use  of  silicon. 

2.2.4.  Cell  generation 

Cell  generation  is  a  function-specific  automatic  technique  for  performing 
place  and  route  at  extremely  fine  grain.  Cell  generation  is  an  effective  tech- 
nique for  generating  high  quality,  customized  circuits.  However,  today's  cell 
generator;  can  only  generate  very  regular  layout  structures  such  as  RAMs. 
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ROMs,  PLAs  and  datapaths. 

Because  the  layouts  produced  by  cell  generators  are  so  regular,  the  lay- 
out can  be  optimized  by  mapping  it  into  a  combinatoric  problem  and  solving 
it  combinatorially.  For  example,  the  area  that  a  PLA  consumes  is  essen- 
tially equal  to-  the  product  of  the  number  of  terms  in  each  term  plane. 
Reducing  this  product  reduces  the  size  of  the  PLA.  Hence,  optimization  pro- 
grams for  PLA  splitting  smd  folding  operate  without  any  detailed  knowledge 
of  layout,  and  hence  quaUty  regular  cell  generation  is  easily  and  effectively 
partitioned.  Moreover,  since  these  combinatorics  are  difficult  to  optimize  by 
inspection,  the  cell  generator  often  performs  better  than  a  human  can. 

Although  these  structures  are  very  effective  when  appropriately  used,  a 
larger  variety  of  structures  are  needed  than  provided  for  in  the  above  set. 
For  exEimple,  if  an  n-bit  exclusive-or  is  generated  using  a  PLA,  the  amount 
of  area  consumed  will  be  exponential  since  the  PLA  is  dependent  on  the  size 
of  the  disjunctive  normal  form.  A  more  appropriate  implementation  as  a 
tree  of  binary  exclusive-ors  takes  only  n  log  n  space. 

2.3.    Custom  layout  tools 

There  are  two  methods  used  for  the  creation  of  custom  layout.  They 
involve  the  use  of  a  graphics  editor  or  a  programming  language. 
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2.3.1.    Procedural  layout 

Early  procedural  layout  languages  such  as  LAP  were  little  more  than 
pre-processors  for  GIF  [HonSO]  and  provided  the  advantages  of  symbolic 
expression  and  better  naming.  However,  these  languages  were  crude,  pro- 
viding the  designer  with  a  difficult  correspondence  problem  between  the 
source  code  and  the  resulting  layout.  With  the  introduction  of  graphics  edi- 
tors such  as  Caesar  [OusterhoutSl],  these  languages  were  quickly  replaced 
despite  the  advantages  of  their  parameterizability  and  computation  capabil- 
ity. Both  LAP  and  Caesar  represented  layouts  only  as  rectangles  and  layers 
—  the  transistors,  contact  cuts  and  wires  composing  the  circuit  had  to  be 
found  by  circuit  extractors  and  geometric  design  rule  checkers. 

A  later  language  is  Sticks  and  Stones  [CardelliSO].  Contrary  to  its 
name,  Sticks  and  Stones  has  nothing  to  do  with  the  VLSI  abstraction  of 
Sticks  Diagrams;  it  linguistic  origins  are  presumably  in  some  (obscure^  Scot- 
tish phrase.  This  language  is  similar  to  DPL  [BataliSO],  and  shares  with  it 
several  attributes.  Both  languages  create  the  layout  as  a  drawing  of  fully 
fleshed  geometry  (with  no  notion  of  design  rules  or  circuit).  The  drawing  is 
composed  of  objects  with  ports,  which  are  places  to  be  wired  to  or  which  can 
match  other  ports  by  abutment.  The  objects  are  hierarchically  structured. 
Sticks  and  Stones  includes  a  number  of  transformations  such  as  move  and 
rotate  which  transform  the  object  while  extending  the  wiring  paths  to  main- 
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tain  connectivity.  However  the  designer  must  be  very  careful  to  prevent  the 
wiring  paths  from  illegally  intersecting  other  geometry.  In  addition,  names 
can  be  propagated  to  different  levels  of  the  hierarchy  by  renaming  opera- 
tions and  by  restriction  operations,  which  remove  names  from  the  object. 
Since  Sticks  and  Stones  is  an  applicative  language,  the  operations  produce 
new  objects  with  the  desired  attributes. 

The  DPL/Daedulus  [BataliSO]  was  produced  at  about  the  same  time  as 
Sticks  and  Stones.  Extensive  use  is  made  of  naming  and  symbolic  variables 
in  order  to  reduce  the  absolute  grid  numbering  of  earlier  systems.  Primitive 
point-to-point  wiring  routines  are  embedded.  However,  it  is  the  responsibil- 
ity of  the  designer  to  ensure  that  objects  are  properly  placed,  and  post  design 
rule  checking  is  required  for  verification.  Moreover,  there  is  not  much  abil- 
ity to  change  existing  designs;  components  are  hierarchically  based,  but 
within  the  hierarchy  they  are  composed  of  rectangles.  Hence,  the  design 
does  not  contain  higher  level  notions  such  as  wires,  cells  or  circuits.  DPL 
was  used  in  large  designs  extensively  with  a  graphics  editor;  it  was  also 
used  to  produce  an  early  datapath  generator. 

Another  class  of  design  languages  are  the  constraint  based  languages  in 
which  relative  location  is  described,  eind  absolute  location  is  determined  by 
some  form  of  compactor.  Examples  of  this  type  of  system  include  ALI  [Lip- 
ton82]  and  HILL  [LengauerS4].     There  are  two  primary  advantages   to  this 
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technique  over  entry  using  sticks-based  graphics  editors.  First,  cells  can  be 
parameterized  for  scaling  (eg.  width  of  Vdd  lines  J  or  for  functionality  leg. 
number  of  bits  in  an  ALU)  by  the  generating  program.  Second,  recursive 
structures  (such  as  H-trees)  can  easily  be  built.  However,  these  programs 
produce  static  structures  which  are  unmodifiable.  Hence,  all  placement 
information  must  be  decided  initially.  It  appears  that  this  technique  will 
not  scale  well  to  non-regular  cells. 

2^.2.   Graphics  Based  Design  Systems 

By  far  the  most  common  interface  for  designing  custom  integrated  cir- 
cuits is  the  graphics  layout  editor.  Graphics  based  systems  have  several 
advantages  over  non-graphical  systems: 

•  They   provide    feedback    on   the  quality   of  the  design   (human   pattern 
recognition) 

•  They   provide    a  mechanism    for  determining   what   changes    should   be 
made  to  the  design  (human  pattern  recognition) 

•  They  provide  a  mechanism  for  changing  the  design  (the  graphics  opera- 
tors) 

Together,  these  three  attributes  enable  the  graphics  editor/designer   to  form 
a  very  effective  interface. 
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However,  there  are  a  number  of  drawbacks  to  this  methodology  which 
increase  in  severity  as  the  level  of  integration  is  scaled.  First,  scaling 
results  in  ever  larger  designs  which  must  be  looked  at  through  a  limited 
window.  Second,  a  graphics  editor  captures  only  the  end  design  —  not  the 
process  of  design.  When  parts  of  the  design  must  be  redone,  or  when  other 
designs  need  high  performance  cells,  these  cells  must  be  redesigned  or  exten- 
sively edited  by  hand.  This  increases  the  design  cycle  time  and  requires 
tedious  re-work.  Third,  different  views  of  the  design  are  hard  to  integrate. 
Finally,  the  structure  of  the  design  must  be  annotated  with  auxiliary  infor- 
mation such  as  names,  design  intent  and  constraints. 

We  shall  discuss  one  graphics  based  design  system.  Magic 
[Ousterhout84],  and  contrast  it  with  GENERIC.  Magic  is  a  second  genera- 
tion design  system,  and  contains  the  changes  characteristic  to  second  gen- 
eration VLSI  design  systems.  It  is  component  based  and  is  more  highly 
integrated  with  the  rest  of  the  design  system.  Circuit  extraction  is  a  much 
cheaper  operation  than  in  earlier  systems  and  design  rule  checking  is  com- 
pletely integrated,  running  as  a  background  process.  In  addition,  the  opera- 
tors are  higher  level,  and  one  of  them,  plowing  [Scott84]  knows  about  design 
rules  and  compaction. 

Plowing  is  a  primitive  compaction  operator.  Unlike  full  compaction, 
plowing  does  not  attempt  to  globally  optimize  the  size  of  the  cell.    Instead, 
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plowing  performs  a  one-dimensional,  single  column  move.  The  order  of  the 
application  of  plowing  and  the  extent  of  plowing  will  determine  the  actual 
cell  layout.  A  plow  is  defined  as  a  vertical  (horizontal)  line  segment,  which 
is  moved  in  some  horizontal  (vertical)  direction.  As  the  plow  moves,  the 
geometry  in  front  of  the  plow  is  compressed.  An  example  is  shown  in  figure 
2.6 


plow 


(a)  before  plow  (b)  after  plow 

Figure  2.6  —  Plowing  compression 
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The  Moce  operator  in  GENERIC  is  similar  to  plow.  Plow  takes  a  fixed 
size  line  segment  and  moves  objects  in  the  path  until  the  plow  has  been 
moved  a  fixed  distance.  Move,  on  the  other  hand,  takes  a  list  of  objects, 
which  need  not  be  contiguoias,  and  moves  them  until  either  the  desired  dis- 
tance is  moved  or  an  object  is  encountered  which  cannot  be  moved.  Of 
course,  given  that  the  operator  is  embedded  in  a  programming  language,  it 
is  easy  to  simulate  plow.  Other  types  of  operators  could  also  be  simulated 
(in  GENERIC),  such  as  the  one  in  figure  2.7  which  moves  obstructions 
orthogonally  to  the  direction  of  plowing. 


^ 

1 

; 

(a)  L-shaped  wire  to  be 
"plowed"  to  the  right 


(b)  contact  moved  down  so 
that  wire  moves  right  without 
relocating  contact  further  to  right 


Figure  2.7  -  GENERIC  meta-operation  alternative  to  plowing. 
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2.4.    Layout  strategies 

The  description  below  describes  layout  methodologies  which  are  in  com- 
mon use  by  VLSI  designers.  GENERIC  is  capable  of  producing  layouts 
using  each  of  these  methodologies. 

2.4.1.  Random  logic 

The  most  basic  (and  first)  design  technique  is  the  so  called  random 
logic  layout.  The  designer  is  able  to  attain  optimal  design  by  freedom  from 
design  methodology  constraints;  any  VLSI  structure  can  be  realized.  This 
technique  is  also  the  most  expensive  to  design  in,  and  the  most  inflexible  to 
modify.  However,  random  logic  is  the  methodology  for  doing  most  low  level 
cells.  Thus  even  systems  which  use  highly  replicated  cells  need  random 
logic  for  the  design  of  the  replicated  cells. 

2.4.2.  Rigid  wire  logic 

A  standard  attempt  to  save  layout  time,  and  increase  the  modifiability 
of  the  design  is  with  rigid  wire  layout.  This  technique  treats  most  wires  as 
rigid,  pre-laid  out  components,  and  components  are  placed  under  the  wiring. 
A  typical  example  is  a  PLA  generator.  A  variation  of  this  scheme  lays  out 
tracks  in  metal  for  horizontal  routing,  and  vertical  routing  in  poly  or 
diffusion.    An  example  of  this  is  SLAP  [Reiss82]. 
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2.4.3.  Route  through 

Mead   and  Conway    have   observed   that   wires   dominate   VLSI   design. 
Hence,  good  planning  of  wiring  is  essential  to  achieving  high  quality  layout. 
One  technique  to  enhance   the  degree  of  wirability  is  to  make  cells  loose 
enough  to  allow  unrelated  connections  to  pass  through  rather  than  having  to 
go  around  a  cell.    Since  wires  will  dominate,  topological  planning  of  wires 
dominates   in   this  design    style   with   cells   being   interleaved    underneath. 
Anceau  and  Reis  [Anceau82]  have  examined  this  as  a  metric  which  is  well 
matched  to  density  in  various  commercial  designs.    Since  this  seems  to  be  a 
better  metric  than  transistor  density,  they  argue  that  this  measure  should 
be  optimized.    Their  motto  is  "wires  first,  transistors  under."  This  method  is 
more  complex  than  the  previous  one,  since  it  involves  modifications  of  com- 
pleted cell  design.    However,  the  VLSI  dictum  of  "wires  dominate,  not  active 
components"  is  well  supported  by  their  empirical  research. 

2.4.4.  By  abutment 

Another  popular  design  strategy  is  to  produce  cells  which  can  be  wired 
by  simply  abutting  two  connecting  cells.  This  approach  requires  both  tight 
topological  and  tight  geometric  design.  Clearly,  for  highly  replicated  cells 
this  is  the  preferred  method  since  wiring  dominates  in  VLSI  and  the  wiring 
(and  its  combination  with  logic)  are  carefully  thought  out.  The  quality  of 
these   cells    are    an    important    determinate    to   the   overall    size   and    power 
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consumption    of  the   chip.     However,    the   speed    of  the   chip   is   much   more 
dependent  on  the  less-regular  controlling  logic. 
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CHAPTER    3 


The  GENERIC  Layout  System 


3.   Introduction 

In  this  chapter  the  overall  structiire  of  the  GENERIC  Layout  System 
iGLS)  is  described,  including  several  components  which  have  not  been 
implemented.  The  software  architecture  for  all  of  GLS  is  described  to  indi- 
cate ovir  design  philosophy. 

3.1.    Layers  of  GLS 

The  overall  structure  of  GLS  is  as  a  layered  system  as  shown  in  figure 
3.1. 
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Figure  3.1  —  Structure  of  GLS 


The  programming  language  GENERIC  is  the  base  layer  of  the  system,  and 
is  the  central  topic  of  this  thesis.  GENERIC  supports  the  basic  data  struc- 
tures used  to  describe  several  views  of  a  VLSI  design,  from  circuit  to  layout. 
In  addition,  GENERIC  provides  operators  for  the  manipulation  of  topologies 
and  topographies.  These  operators  are  used  both  to  derive  and  to  improve  a 
layout. 

On  top  of  GENERIC  is  built  the  Flexcell  library,  which  is  a  collection  of 
cell  generators.  Flexcell's  cell  generators  are  intended  to  produce  the  entire 
rsinge  of  cells  which  might  be  found  in  a  standard  cell  library.  Traditional 
standard  cells  contain  tightly  interleaved  circuit,  topology  and  geometric 
information.     In  Flexcell.   these  three  domains  are  decoupled.     For  each  cir- 


48 


cviit  there  are  several  different  topologies;  the  best  one  depends  both  on  the 
circuit  parameters  and  on  the  other  cells  to  which  it  is  to  be  connected. 
Flexcells  topologies  are  encoded  in  the  generator,  and  can  be  selected  either 
at  cell  generation  time  or  later  on  in  the  design  process.  Moreover,  even 
after  a  topology  is  selected  the  cell  may  be  modified  both  topologicals  and 
geometrically. 

Cell  manipulators  are  programs  for  modifying  the  layout  of  cells  to  meet 
specific  criteria.  One  cell  manipulator  might  enhance  circuit  performance  by 
re-routing  high  capacitance  wires  on  the  critical  path  into  metal.  Another 
might  improve  the  density  of  the  layout  by  compaction  or  aspect  ratio 
modification.  We  believe  that  the  ability  to  modify  existing  circuits,  while 
maintaining  correctness  is  crucial  to  the  generation  of  high  quality,  irregu- 
lar cells.  The  embedded  topologies  provide  standard  "good"  starting  plans 
for  the  particular  circuit,  while  delaying  decisions  on  the  exact  shape  or 
even  exact  topology  until  the  cell  is  composed  with  other  cells. 

The  final  layer  in  the  software  architect\ire  is  a  general  purpose  silicon 
compiler. 

3.1.1.    GENERIC:  the  programming  language 

GENERIC  is  the  programming  language  in  which  GLS  is  written.  It  is 
a  standard  high  level  language,  at  approximately  the  level  of  Lisp  [Steele84] 
and    provides    automatic    storage    allocation    and    garbage    collection,    lists. 
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structural  naming  mechanisms,  support  for  design  integrity,  and  support  for 
hierarchical  structuring.  The  language  is  a  general  purpose  programming 
language  with  VLSI  extensions  contained  in  a  runtime  library.  The 
language  also  contains  some  unique  features  for  building  the  hierarchical 
structures  necessary  to  organize  the  design. 

We  choose  to  design  a  language  (instead  of  just  a  runtime  system)  to 
enable  the  support  and  ensure  the  integrity  of  features  described  above. 
Equally  important  was  the  abihty  to  use  compile  time  analysis  of  the  pro- 
gram to  optimize  runtime  performance. 

GENERIC  is  hierarchical.  The  hierarchical  design  structure  is  the 
result  of  the  calling  structure  of  the  program.  Functions  called  generators 
produce  C-like  structures  [Kernighan78]  representing  different  parts  of  the 
design,  for  example  a  wire  segment  or  a  cell.  The  nodes  in  the  hierarchical 
tree  are  all  cells.  Cells  contain  primitives,  such  as  a  transistor  or  a  contact 
cut,  and  other  cells,  sometimes  called  subcells.  Wires  are  represented  as  a 
graph  of  wire  segments;  however  wires  which  are  used  only  interior  to  a  cell 
are  sometimes  considered  as  part  of  that  the  cell.  The  hierarchical  structure 
of  the  design  is  formed  by  creating  (recursively)  subcells  within  a  given  cell 
by  calhng  cell  generators  within  a  cell  generator.  The  language  ensures 
that  the  various  data  structures  are  connected  together  properly. 
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The  dominant  information  in  GENERIC  is  the  circuit  being  created. 
This  circuit  is  initially  produced  with  a  layout  which  is  design  rule  correct 
by  generic's  extended  design  rules,  but  which  may  be  incomplete  or 
.  unmanufacturable.  The  process  of  running  GENERIC  successively  adds 
information  until  the  circuit  is  complete.  Using  the  VLSI  layout  operators 
in  GENERIC,  the  layout  is  transformed  into  one  which  can  be  manufac- 
tured. 

The  operators  in  GENERIC  are  both  design  rule  safe  and  connectivity 
maintaining.  A  unique  mechanism  called  planes  enables  topological  mani- 
pulations which  are  design  rule  safe.  Moreover,  these  manipulations  also 
maintain  wirability. 

Since  layouts  are  derived  from  circuits,  the  circuit  can  be  validated 
before  layout.  Assuming  mild  restrictions  (i.e.  Mead/Conway  design  rules), 
the  layout  derived  from  the  initial  circuit  is  guaranteed  to  implement  the 
circuit.  It  is  trivial  to  produce  a  manufacturable  layout  through  a  merging 
of  planes,  although  this  would  not  be  optimal.  However,  the  purpose  of  the 
planes  is  to  allow  the  development  tools  and  methodologies  to  produce  high 
quality  circuits.  Hence,  it  is  easy  to  derive  correct  and  manufacturable  lay- 
outs. We  believe  the  power  of  abstraction  provided  by  GENERIC'S  struc- 
tuires  and  operators  enable  the  production  of  high  quality  circuits  as  well. 
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3.1.2.    Flexcell:  the  cell  library  system 

The  next  layer  of  the  system  is  the  Flexcell  library.  Although  GLS  is  as 
technology  independent  as  pwssible,  there  are  inevitable  technolog>-  depen- 
dencies. In  GENERIC,  these  dependencies  are  contained  at  two  places;  the 
geometric  design  rule  spacing  and  the  formation  of  the  primitives,  such  as 
transistors  and  contacts.  The  low  level  technology  dependencies  are  all 
encoded  at  the  GENERIC  layer. 

At  the  Flexcell  level  the  critical  technology  dependencies  are  embedded 
in  the  circuit  designs.  However,  there  is  a  qualitative  form  of  technology 
dependence.  The  qualitative  technology  dependence  represents  learned  good 
design  techniques,  for  example  a  particularly  compact  way  to  layout  a  PLA 
in  nMOS.  These  qualitative  or  indirect  dependencies  do  not  effect  the 
correctness  of  a  design;  only  its  optimality.  At  the  Flexcell  level,  these 
include  topologies  for  a  circuit  in  the  library.  Thus,  use  is  made  of  the  con- 
siderable heritage  of  clever  layouts  for  various  highly  used  cells.  We  have 
encoded  these  as  topological  templates  in  the  cell  generation. 

The  Flexcell  library  is  intended  to  encode  only  technology-  dependencies. 
Hence,  circuits  produced  by  Flexcell  generators  will  typically  not  be  com- 
pacted since  this  can  just  as  easily  be  performed  by  a  general  purpose  tool. 
A  design  goal  of  this  system  is  to  distill  out  any  general  purpose  layout  tools 
from  this  layer  so  that  the  cell  libraries  are  as  small  and  as  flexible  as  possi- 
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ble. 

The  tools  which  are  distilled  become  part  of  the  cell  manipulator  layer. 

3.1.3.    Cell  manipulators 

The  Flexcell  system  is  almost  completely  composed  of  technology  depen- 
dent functions  which  are  either  critical  or  qualitative.  Since  Flexcell 
represents  the  specific  technology  information,  and  hence  must  be  re-written 
for  new  technologies,  it  is  desirable  to  separate  the  technology  independent 
information  and  to  provide  a  level  of  composition  tools.  This  is  the  role  of 
the  cell  manipulator  layer. 

One  type  of  cell  manipulator  improves  the  quality  of  the  design.  Since 
different  metrics  of  quality  exist  for  different  technologies,  it  is  desirable  to 
have  many  small'  tools,  a  variety  of  which  are  applicable  in  a  particular 
situation.  The  applicable  tools  depend  both  on  the  technology  and  the 
design  style.  Hence,  planarity  tools  would  probably  be  less  important  in  a 
technology  with  4  metal  layers  than  in  a  standard  1  metal  layer  nMOS  pro- 
cess. 

A  second  type  of  tool  provides  metrics  indicating  the  quality  of  the 
design.  The  standard  metric  that  exists  now  is  transistors  per  square  mil. 
A  better  measure  would  be  functionality  per  unit  area.  However,  some  lay- 
out methodologies  fsuch  as  function  blocks)  have  high  transistor  densities, 
but   may  be  an   inappropriate    method   to  implement    a  particular   function. 
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Since   different   designs    have    different    goals,    and   hence    require    different 
tradeoffs,  one  metric  will  not  capture  the  quality  of  design. 

To  improve  a  design  automatically,  metrics  are  needed  to  determine 
which  parts  of  the  design  need  improvement.  Preferably,  they  will  also  nar- 
row the  range  of  strategies  employed  to  improve  the  quality  of  the  design. 
Until  our  CAD  systems  are  sufficiently  flexible  for  these  metrics  to  be 
employed,  we  are  unlikely  to  develop  meaningful  measures. 
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CHAPTER    4 


Language  Principles  and  Structures 


4.   Language  Principles  and  Structures 

This  chapter  gives  a  summary  of  the  programming  language  GENERIC. 
For  a  more  detailed  description  see  [Solworth85]. 

4.1.    Language  goals 

The  view  that  we  have  taken  in  GENERIC  is  to  consider  the  layout  pro- 
cess as  a  computation.  Unlike  in  traditional  silicon  compilers,  there  is  no 
algorithm  which  is  hardwired  into  the  system.  The  layout  is  directly  con- 
trolled by  the  designer,  who  can  construct  an  appropriate  layout.  When  this 
controllability  is  combined  with  the  standard  cell  libraries  it  yields  a  very 
flexible  layout  system.  Furthermore,  an  existing  layout  can  be  modified  by 
the  application  of  operators  to  the  existing  circuit.  This  layout  modification 
is  not  only  geometric  but  topological,  and  we  allow  the  full  topological  mani- 
pulation without  ever  violating  our  extended  design  rules. 

4.L1.    Repetition,  Hierarchy,  and  Abstraction 

It  has  been  pointed  out  that  the  only  way  to  manage  the  complexity  of 
VLSI  design  is  through  the  hierarchical  structuring  of  solutions.    It  must  be 
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possible  to  look  at  a  piece  of  the  design,  and  understand  it  and  its  interface 
to  the  rest  of  the  design.  The  representation  should  support  the  appropriate 
abstract  views  and  annotated  interfaces  so  that  components  can  be  under- 
stood in  isolation.  For  VLSI,  the  abstract  views  include  circuit,  logic,  and 
layout.  The  interface  includes  power  and  ground,  signals  and  their  types, 
and  restrictions  on  usage.    These  are  the  static  hierarchical  techniques. 

A  larger  issue  is  the  dynamic  structuring  used  for  the  creation  and 
modification  of  the  design.  Because  of  the  complexity  of  VLSI  design  and 
the  diversity  of  design  issues,  it  is  unlikely  that  the  initial  layout  of  the  chip 
will  be  satisfactory.  Moreover,  the  cost  of  performing  high-quality  initial 
generation  is  likely  to  be  very  high  —  modifications  must  be  possible  which 
are  cheaper  than  recomputing  the  whole  layout.  Hence,  future  design  sys- 
tems will  be  required  to  support  incremental  modification. 

It  is  necessary  to  reduce  the  complexity  of  design.  The  simplest  method. 
supported  by  all  layout  systems,  is  to  define  an  elementary  component  and 
then  replicate  it  massively.  The  resulting  regular  design  enables  a  large 
amount  of  attention  to  be  spent  on  the  base  cell  and  its  interconnect.  It  also 
makes  the  design  easier  to  understand.  Repetition  has  been  extensively  and 
effectively  implemented  in  all  layout  tools,  especially  in  generators. 

Hierarchy  is  used  to  define  the  scope  of  operation.  It  can  be  used  to  res- 
trict the  focus  of  an  operator   to  a  limited  part  of  the  design  or  to  specify 
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aggregates  upon  which  to  apply  the  operator.  In  selecting  a  hierarchy  it  is 
important  to  decide  what  the  hierarchy  means  and  how  it  meshes  with 
design  views.  Classical  systems  require  that  the  same  hierarchy  exist  for 
physical  as  well  as  logical  design.  Hence,  an  adder  which  is  composed  of 
exclusive -ors,  ands,  and  ors  at  the  logical  level  would  require  each  gate  to  be 
laid  out  in  separate  regions.  Separate  layout  regions  mirror  the  structure  of 
separate  gates  in  the  logic  view.  To  circumvent  such  restrictions,  adders  can 
be  defined  as  primitive  logic  elements,  and  then  all  degrees  of  freedom  in 
layout  are  available.  While  this  certainly  fixes  the  problem,  it  is  preferable 
to  derive  such  cells  from  component  logic  parts. 

In  GENERIC,  the  dominant  hierarchy  is  the  circuit  and  from  the  circuit 
any  layout  can  be  realized,  including  the  physical  interleaving  of  logical  dev- 
ices. Unfortunately,  this  both  reduces  the  locality  and  increases  the  size  of 
computation.  We  expect  that  hierarchical  mismatches,  while  very  desirable 
to  support,  will  in  practice  occur  seldom  and  the  design  will  exhibit  most  of 
the  locality  of  the  more  rigid  technique.  Hence,  by  writing  the  code  to 
assume  the  match  of  hierarchy  (and  trapping  out  if  this  is  not  the  case), 
there  should  only  a  minor  loss  of  efficiency. 

An  abstraction  is  a  view  that  may  not  be  entirely  accurate  but  is  simple 
and  reasonably  represents  an  interesting  domain.  Since  abstraction  is  a 
simplification,    it    reduces    the    detail.     It    also    simplifies    the    the   effect    of 
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actions.  An  important  simplification  in  GENERIC  is  that  operators  are  both 
design  rule  preserving  (in  our  extended  sense)  and  wire  connectivity  main- 
taining. Still,  without  a  graphics  terminal  the  designer  is  "blind"  and  must 
keep  the  design  in  his  mind's  eye.  The  mechanisms  in  GENERIC  are 
insufficient  to  do  this  -  hence,  GENERIC  is  used  as  the  kernel  of  a  larger 
system. 

To  compensate  for  the  blindness  we  need  metrics,  planning  and  meta- 
operations.    These  are  beyond  the  scope  of  this  work. 

4.1.2.    Naming 

On  large  designs  it  must  be  possible  to  specify  the  component  pieces. 
This  requires  a  methodology  for  naming.  This  naming  process  is  inherently 
textual  and  redundant.  For  example,  named  components  on  an  enhance- 
ment transistor  named  et  might  be  source,  drain,  gate  and  gate2  (and 
denoted  et.source,  etc).  Different  names  are  needed  at  different  levels  of  the 
hierarchy;  for  example,  if  et  is  used  as  part  of  an  inverter  (named  inu),  then 
inv.in  might  be  an  alias  for  inv.et.gate.  Naming  is  better  suited  to  a  pro- 
gramming language.  Two  problems  result  when  names  are  used  in  a  graph- 
ics editor:  there  is  a  clutter  of  textual  information  on  the  screen,  and  it  is 
difficult  to  associate  a  name  with  a  level  in  the  hierarchy  (since  the  picture 
flattens  several  levels  of  the  hierarchv). 


58 


A  second  requirement  is  to  categorize  existing  textual  information  into 
object  names,  type,  and  scope.  Programming  language  mechanisms  are  well 
developed  to  support  these  needs  syntactically  and  to  check  them  at  compile 
time.  If  more  complex  structures  are  required,  they  can  be  easily  con- 
structed within  the  programming  language. 

The  third  requirement  is  to  be  able  to  convey  design  intent  to  other 
CAD  tools.  Communicating  to  simulators,  timing  analyzers  and  circuit 
analysis  tools  requires  dictions  for  concisely  and  completely  specifying  inter- 
faces. A  common  method  of  annotating  designs  is  by  the  use  of  (psuedo) 
labels,  which  is  a  naming  mechanism.  Notations  decrease  computational 
requirements,  decrease  spurious  errors  and  enable  better  integration 
between  existing  tools.  For  example,  the  timing  analyzer  Crystal 
[OusterhoutSS]  can  experience  exponential  blowup  (and  return  inaccurate 
results)  if  its  analysis  of  the  number  of  paths  is  not  refined.  Crystal  is 
unable  to  determine  when  some  paths  are  mutually  exclusive  and  hence 
requires  directives  which  declare  the  direction  of  information  flows  in  a 
transistor  (since  MOS  transistors  are  inherently  bidirectional). 

4.1.3.   Portability 

Mead/Conway  design  rules  have  served  university  and  industry  designs 
well   for  low   volume   production    integrated    circuits.     However,    aggressive 

designs  will  require  access  to  highly  tuned,  fabrication  line  dependent  rules. 
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In  order  to  be  able  to  use  different  fabrication   lines,   and  to  enable   more 
effective  scaling,  GENERIC  is  design  rule  independent. 

Layout  is  viewed  as  a  computation,  and  the  programs  to  compute  the 
layouts  are  written  to  be  largely  independent  of  the  design  rules.  It  is  possi- 
ble to  design  a  circuit  for  one  fabrication  line,  debug  it,  and  then  re-execute 
the  program  using  different  design  rules  for  a  better  and  more  expensive  fab 
line.  This  same  facility  enables  the  scaling  of  integrated  circuits  with 
shrinking  feature  size.  Scaling  is  a  non-trivial  task,  since  features  do  not  all 
shrink  uniformly,  certain  problem  become  more  acute  (metal  migration, 
power  dissipation)  and  performance  balances  start  to  shift.  It  is  much  easier 
to  change  the  source  code  (GENERIC  program)  than  to  modify  the  object 
code  (mask  level  artifact). 

4.2.    Validation  vs.  Correct-by-design 

Early  design  automation  relied  extensively  on  verification  techniques  to 
ensure  that  circuits  would  work  when  fabricated.  Geometric  and  electrical 
design  rules,  simulation,  net  list  comparison  and  physical  probing  are  among 
the  earliest  techniques  to  ensure  correctness.  These  techniques  are  effective 
in  catching  bugs  but  they  have  several  drawbacks. 

The  first  problem  is  that  errors  are  caught  late  in  the  design  cycle.  For 
example,  early  design  rule  checkers  returned  errors  after  a  design  or  a  com- 
ponent was  completel\   laid  out     Later  design  rule  checkers,  such  as  Magic's. 
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performed  design  rule  checks  interactively,  providing  faster  response  time. 
While  much  more  effective,  there  is  still  a  lag  between  creation  and  error 
detection:  it  would  be  better  if  it  was  impossible  for  the  error  to  occur. 

The  second  problem  is  a  correspondence  problem.  Once  an  error  is 
detected,  it  must  be  understood  in  terms  of  the  source  level  representation 
and  it  must  be  fixed.  Source  level  means  the  representation  in  which  the 
error  occurs.  For  example,  when  a  layout  does  not  match  the  circuit  descrip- 
tion, the  layout  usually  must  be  fixed.  This  involves  the  comparison  of  two 
descriptions,  and  the  "interpreting"  of  the  layout  as  a  circuit.  Finally,  a 
change  must  be  determined  that  will  bring  the  two  descriptions  into 
correspondence.  Sometimes,  the  circuit  is  changed  by  the  layout  designer 
(without  changing  the  logic  it  implements)  to  obtain  a  tighter  layout.  The 
decouphng  of  circuit  and  layout  descriptions  makes  the  process  tedious;  an 
integrated  description  removes  the  correspondence  problem. 

4.3.   The  structure  of  GENERIC 

In  this  section  the  major  data  structures  of  GENERIC  are  described, 
and  their  purpose  is  illustrated. 

In  GENERIC,  an  integrated  circuit  design  is  represented  by  a  data 
structure  which  is  created  during  the  execution  of  the  program.  The  struc- 
ture is  hierarchical  and  is  composed  of  cells. 
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The  cell  hierarchy  is  created  dynamically.  An  implicit  first  procedure  to 
be  called  in  a  GENERIC  program  is  a  cell  generator  which  will  create  the 
root  cell.  Whenever  another  cell  generator  is  called,  the  runtime  stack  is 
examined  to  determine  which  was  the  more  recent  uncompleted  cell  genera- 
tor call;  the  cell  corresponding  to  that  cell  generator  is  then  made  into  the 
parent  of  the  cxirrent  cell.  Hence  the  structure  is  incrementally  built  up  as 
the  program  executes. 

We  will  now  describe  the  data  structures  in  GENERIC  in  greater  detail. 

4.3.1.    Primitives,  Wires  and  Cells 

Integrated  circuit  fabrication  creates  wires  and  devices  through  a  litho- 
graphic process.  Devices  are  formed  through  the  interaction  of  different 
layers;  for  example,  self-aligned  transistors  are  formed  by  crossing  of  polysil- 
icon  and  diffusion.  Thus,  the  manufacturing  representation  is  based  upon 
mask  steps  and  there  is  very  little  structure  in  the  masks.  Early  VLSI  CAD 
tools  adopted  the  manufacturing  representation  as  the  computational 
representation. 

Later  systems  were  more  object  based.  In  GENERIC,  the  devices  are 
produced  by  parameterized  routines  and  thereafter  these  devices  have  fixed 
shape.  These  devices  (transistors  and  contacts)  su-e  called  primitives.  Primi- 
tives can  be  thought  of  as  circuit  elements,  for  example  an  enhancement 
transistor.     They    may   also   be   considered    as   layout.     Thus,   the   structure 
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contains  both  electrical  specification  and  physical  shape  and  location.  In 
addition  to  transistors,  primitives  also  include  vias  between  layers  and  capa- 
citors. 

Primitives  contain  one  or  more  equipotential  regions  (also  called  nets). 
These  regions  may  be  connected  to  other  regions  on  the  same  or  different 
primitive.    Assuming  that  the  pull_up  and  pulLdown  have  been  created, 

connecti pull-up.source,  pull-down.drain) 

creates  the  electrical  connection  between  two  regions.  Since  the  pulLup  is 
defined  as  having  its  drain  connected  to  Vdd  (and  the  pull—down  with  its 
source  connected  to  Gnd)  the  inverter  is  electrically  complete,  and  can  be 
simulated. 

Only  after  the  circuit  has  been  verified  to  implement,  the  logic  function, 
is  it  worth  performing  the  layout.  An  inverter's  layout  is  computed  by  first 
aligning  the  pull-up. source  with  the  pull-down. drain.  The  two  points  are 
then  wired  together.  A  wire  is  a  physical  realization  of  the  electrical  con- 
nection, and  has  width,  layer  and  length  and  location  information.  The 
inverter  is  now  physically  (and  electrically)  complete. 

It  is  possible  to  design  and  layout  the  entire  chip  in  this  manner.  How- 
ever, the  result  is  a  flat  description  of  the  design.  A  preferable  description 
would  be  hierarchical.  The  hierarchical  elements  in  GENERIC  are  cells. 
Cells  are  collections  of  primitives  and  other  cells,  sometimes  called  subcells. 
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Wires  form  a  graph  on  top  of  the  tree  of  cells;  however,  for  some  purposes 
wires  are  included  as  part  of  a  cell  when  all  primitives  on  which  the  wires 
are  incident  on  are  part  (transitively)  of  the  cell. 

4.3.1.1.    Primitives 

A  primitive  is  produced  by  a  generator,  which  is  a  procedure  which 
inherently  creates  an  object.  For  example,  the  skeleton  for  a  primitive  gen- 
erator of  a  polysilcion  to  diffusion  contact  is  given  below: 

PrimitiveT 

pd_contact(FLOAT  width,  FLOAT  height) 

end; 

where  PrimitiveT  is  a  C  style  structure  which  contains  a  number  of  declared  ■ 
fields  plus  a  number  of  impromptu  fields.    The  declared  fields  contain  infor- 
mation common  to  all  primitives,  such  as  geometry,  points  on  the  primitive, 
and  so  forth.    Impromptu  fieids  include  information  common  to  a  subset  of 
the  primitives,  such  as  length,  width,  and  the  label  source. 

Some  fields  are  used  just  for  computation.  We  will  discuss  these  fields 
under  the  appropriate  computation  description,  or  if  their  use  is  obscure,  we 
will  omit  their  discussion. 

The  first  primitive  field  of  interest  is  the  Sparent  field,  which  is  a 
pointer  to  the  cell  which  contains  the  primitive. 
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The  second  field  is  the  list  of  geometry  which  form  the  physical  primi- 
tive.   This  geometry  includes  the  layer  and  the  coordinates  for  the  lower  left- 
hand   and   upper   right   hand   corners   of  the  rectangle.     The   only   type   of 
geometry   which  is  manipulatable    in  GENERIC  is  manhattan   rectangles. 
The  geometry  also  includes  two  electrical  net  pieces  of  information,  $equinet 
and   $equi—raw    to   which   the   rectangle    belongs.     $equinet    is   the   current 
electrical    equipotential    net    to   which    the    rectangle    belongs.     $equi-raw 
describes   whether   two  pieces   of  geometry   have  originally    different   nets. 
Two  objects  with  different  $equi-raw    will  have   the  same  $equinet    if  the 
objects  are  electrically  connected  together.  For  example,  in  a  enhancement 
transistor,    the    gate,    source   and   drain    regions    would    each    be   different 
$equL-raw    values.     If  the    gate   was    connected    to   the    source,    then    the 
$equinet  values  of  the  gate  and  source  would  be  the  same.   The  maintenance 
of  $equi-raw  values  allows  the  equinets  to  be  disconnected. 

A  third  field  in  each  primitive  is  the  $conn^region.  Each  place  where  a 
wire  can  be  connected  to  the  primitive  is  specified  by  a  connection  region. 
The  use  of  connection  regions  inhibits  connections,  for  example  in  the  middle 
of  a  buried  contact.  These  regions  specify  the  layer  of  the  wire,  and  a  rec- 
tangle in  which  the  end  of  the  wire  must  lie.  Thus  a  wire  which  is  four 
units  wide  can  only  be  attached  to  a  rectangle  with  a  side  which  is  of  length 
greater  than  or  equal  to  four.  GENERIC'S  extensive  naming  mechanisms 
allow  objects  to  be  easily  described. 
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Finally,   there  are  some  named  points  on  the  primitive   which  will  be 
useful  in  manipulations  and  connections  to  the  primitive. 

» 

4.3.1.2.    Wires 

Wires  are  formed  by  collections  of  wire  segments.  Each  segment  con- 
tains two  endpoints,  a  width,  and  an  $equinet. 

Wire  segments  may  be  connected  to  other  wire  segments  at  their  end- 
points.  To  perform  the  connection,  the  two  wires  must  have  the  same 
$equinel  or  at  least  one  of  the  wires  must  have  the  null  $equinet.  Any 
number  of  wires  may  be  connected  to  the  endpoint  of  a  wire.  In  addition,  a 
wire  can  connect  to  a  geometric  region  (and  within  a  Sconn^region)  of  a 
primitive. 

The  physical  connection  of  two  wires  or  a  wire  and  the  geometry  within 
a  primitive  is  called  a  pin.  Pinning  not  only  ensures  that  the  endpoints  of 
the  wire  have  the  same  coordinates,  but  also  that  when  the  wire  is 
deformed,  the  object  will  continue  to  have  the  same  coordinates.  A  wire  is 
defined  as  the  closure  of  pins  of  wire  segment  endpoints  vuitil  a  primitive  is 
encountered.  Figure  4.1  shows  two  wires. 
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Figure  4.1  —  two  wires 


4.3.1.3.    CeUs 

Cells  contain  a  psirent,  a  list  of  component  subcells  and  a  list  of  primi- 
tives.   There  are  also  named  points  on  cells. 

4.3.2.    Names  and  points 

Names  and  points  are  ways  of  accessing  and  locating  various  parts  of 
the  design. 

4.3.2.1.    Names 

Inside  a  generator,  a  name  may  be  declared  to  be  either  a  local  or  global 
variable.  If  a  name  is  used  which  is  neither  local  or  global,  than  that  name 
becomes  an  impromptu  field  of  the  structure  being  built  by  that  generator. 
Hence  a  generator  returns  a  structure  which  contains  all  of  the  types 
declared  in  the  structure's  prototype,  plus  fields  corresponding  to  all  of  the 
impromptu    names.     This   enables    the   structure   for   transistors    to   contain 
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source,  drain,  and  gate  names,  as  well  a  channel_width  and  channel-length, 
while  contact  cuts  might  contain  resistance  and  named  points. 

This  mechanism  allows  information  to  be  attached  at  the  appropriate 
level  of  the  hierarchy. 

4.3.2^.    Points 

A  point  is  a  structure  which  contains  an  x-y  coordinate  and  is  part  of  at 
most  one  primitive's  geometry  or  wire  segments.  If  two  wire  segments  are 
connected,  a  point  from  each  are  pinned  together. 

A  point  is  said  to  be  physically  incident  on  the  $geom  or  wire  segment 
which  it  is  part  of,  and  the  point  is  translated  when  the  object  it  is  incident 
on  is  translated. 

4.3.2.3.    PointCell 

Points  represent  coordinates  on  the  lowest  level  physical  objects.  Hence, 
points  specify  both  an  object  and  a  location  on  that  object.  To  specify  loca- 
tions on  non-primitive  objects,  cells,  a  PointCell  type  is  used. 

A  PointCell  contain  two  fields:  the  object  on  which  the  point  is  logically 
incident  and  a  point  which  describe  location  and  physical  incidence. 

Names  may  be  associated  with  points,  pointcells,  or  any  other  object  in 
GENERIC.   The  use  of  pointcells  allows  aggregate  collections  to  be  precisely 

manipulated. 
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4.4.   Operators 

In  general,  there  are  two  alternative  methods  for  deriving  layout,  con- 
straint based  and  operator  based.  Constraint  based  systems  never  modify 
layout  once  the  layout  is  created.  Examples  include  compaction  type  sys- 
tems and  incremental  layout  systems.  A  more  flexible  approach  is  based  on 
operators.  Decisions  are  made  and  applied,  but  decisions  are  not  irrevokable 
—  designs  can  be  continuously  modified  in  a  attempt  to  find  better  quality 
designs. 

Most  non-graphical  VLSI  layout  languages  au-e  constraint  based.  The 
level  of  abstraction  in  these  languages,  to  date,  has  not  been  high  enough  to 
understand  and  make  modifications  to  the  design  by  purely  textual  means. 
In  contrast,  most  graphics  systems  make  much  greater  use  of  the  operator 
mode  since  it  is  more  powerful  —  changes  in  the  design  do  not  require  re- 
work from  scratch.  Using  graphics,  the  problem  of  understanding  the  design 
is  simply  and  effectively  transferred  to  the  designer  across  the  graphics 
interface. 

GENERIC  is  operator  based.  The  increased  level  of  abstraction  present 
in  the  language  is  intended  to  enable  the  effective  combining  of  the  power  of 
operators  and  programming  language  abstraction. 

GENERIC'S     operators     are     intended     to     be     a     balance     between 

effectiveness  and  efficiency.    By  effectiveness,  we  mean  the  ability  to  support 
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any  layout  style.  Hence,  the  operators  must  represent  reasonably  atomic 
actions  so  that  they  can  be  combined  in  different  ways  to  achieve  different 
layouts.  Moreover,  they  must  form  a  complete  or  covering  set,  allowing  any 
layout  to  be  attained  through  at  least  one  series  of  operator  applications. 
They  must  perform  a  task  which  is  both  useful  and  comprehensible  to  the 
designer. 

By  efficiency,  we  mean  that  these  operators  must  be,  in  principle,  capa- 
ble of  performing  with  reasonable  operating  efficiency.  Given  the  size  of  the 
problem,  it  is  desirable  to  have  algorithms  which  work  in  linear  or  quadratic 
time.  We  are  unable  to  measure  the  time  the  operators  in  GENERIC 
require  since  the  system  has  not  been  timed,  and  not  enough  examples  have 
been  nin  to  get  an  empirical  measure. 

Binary  operators  are  of  the  form: 

operandi  operator  operand o 

operandi  operator(parameters}  operand 2 

where  the  first  form  is  one  where  all  the  operator's  parameters  take  their 
default  values.  The  second  form  has  some  parameters  which  are  given  non- 
default  values.  Standard  usage  is  that  operand  ^  is  the  operand  which 
specifies  the  object  to  be  changed  while  operand  2  specifies  the  reference 
operator,  which  specifies  how  operand  ^  is  to  be  changed. 
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In  addition  to  binary  operators,  there  are  unary  operator  of  the  form: 

operator  operand  ^ 

operator{parameters}  operand  ^ 

We  shall  describe  the  operators  and  show  examples  of  their  use.    These 
operators  include  move,  to,  drop,  orient,  and  wrap. 

4.4.1.    Move 

The  binary  operator  move  is  a  sliding  operator.  The  set  of  objects  to  be 
maved  is  called  the  translation  set,  and  is  initialized  to  the  set  of  objects 
specified  by  operandi-  The  move  operation  takes  a  number  of  steps.  On 
each  step,  the  translation  set  is  moved  the  minimum  of  the  total  distance  to 
be  moved  or  to  the  point  where  an  obstruction  is  encountered.  If  an  obstruc- 
tion is  encountered,  and  the  obstruction  is  a  wire  then  the  wire  can  be 
deformed  to  allow  the  translation  set  to  continue  moving.  The  number  of 
ways  in  which  an  obstruction  can  be  overcome  is  extensive  and  is  detailed  in 
the  chapter  on  wire  deformations.  Alternatively,  if  the  obstruction  cannot 
be  deformed  (because  it  is  not  a  wire  or  the  conditions  for  deformation  are 
not  satisfied),  then  move  is  complete.  The  objects  obstructing  movement  are 
returned  in  the  variable  tt. 

An  example  of  a  move  operator  is  shown  in  figure  4.2.    Figure  4. 2. a 
shows  the  initial  layout.    The  transistor  is  to  be  moved  to  the  right  by  some 

large  distance,  but  before  that  distance  is  moved,  the  contact  cut  is  reached 

71 


(4.2.b).  Since  the  contact  cut  is  not  deformable,  the  move  operator  ter- 
minates and  the  global  variable  tt  contains  a  singleton  tuple  consisting  of 
the  contact  cut. 


/gx  (b)  transistor  moved  right 

Figure  4.2  —  Example  of  move  operator 


A  second  example  shows  the  effect  of  wires  deforming  (figure  4.3).  The 
contact  is  moved  to  the  right  until  the  wire  is  encountered  (figure  4.3.b). 
The  wire  is  then  deformed,  by  adding  jogs,  and  in  the  second  step  the  object 
is  moved  its  desired  distance  (figure  4.3.c).  The  total  effect  shown  in  figure 
4.3  is  the  result  of  one  operator  application. 
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(a)  (b)  contact  reaches        (c)  wire  is  deformed 

wire  by  contact 

Figure  4.3  —  Wire  deformation. 


The  move  operator  has  several  variants:  moveJeft,  movej"ight, 
move_up.  move_down,  move_closer,  and  move_further.  The  first  four 
variants  specify  the  direction  in  which  operand  ^  is  to  be  moved.  The  direc- 
tion on  the  last  two  variants  is  calculated  from  the  relative  location  of 
operandi  and  operand  2  in  the  obvious  manner.  The  following  are 
equivalent: 

op  I  move{direction:  =  LEFT}        op  2 
op  I  move-left  op  2 

Moreover  if  opo  is  to  the  left  of  op^,  then  the  following  is  also  equivalent. 
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op  I  move  op  2 

One  more  point.  The  operands  indicate  two  things:  the  set  of  objects  to 
be  moved,  and  reference  points  on  the  objects  for  calculating  distance  and 
direction. 

4.4.2.    To 

The  operator  to  is  a  placement  operator.  Once  again,  the  set  of  objects 
to  be  moved  is  called  the  translation  set.  Unlike  the  move  operator,  objects 
which  obstruct  movement  are  just  passed  over  (and  the  topology  is  changed). 
The  direction  specified  by  the  operator  is  the  side  of  operand 2  that  operandi 
is  placed  on.  For  example,  in  figure  4.4  the  to_right  operator  is  applied  to 
the  contact  cut  which  places  it  to  the  right  of  the  wire.  Compare  this  with 
the  operation  of  the  move  operator  sho\^Ti  in  fig\ire  4.3. 
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(a) 


Figure  4.4  —  Example  of  the  to  operator. 


(b)  contact  placed  to 
right  of  wire 


This  operator  is  more  problematic  to  implement  than  move.  One  prob- 
lem is  that  the  object  being  moved  might  not  fit  where  it  is  intended  to  be 
placed,  even  if  wires  are  deformed.  Another  problem  is  that  the  wires  which 
are  being  deformed  might  cause  design  rule  problems  with  other  geometry  if 
our  goal  of  maintaining  connectivity  is  to  be  maintained. 

Like  the  move  operator,  to  is  effected  in  a  number  of  steps.  Each  step 
moves  the  translation  set,  and  if  the  rule  is  safe;  will  record  it  as  a  possible 
valid  move.  This  allows  the  to  operator  to  temporarily  have  a  unsafe  state, 
to  reach  a  better  valid  state. 
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Wires  are  deformed  only  in  ways  which  do  not  create  design  rule  viola- 
tions. Hence,  intermediate  jumps  of  the  form  shown  in  figure  4.5  are  not 
allowed. 


rj-^^j^  ■X^'OOC^ 


(a)  contact  stationary,  everything  else  moving  towards  right 


(b)  illegal  intermediate  position  of  contact  cut  over  wire 

Figure  4.5  —  Illegal  intermediate  position  for  wires. 

As  can  be  readily  seen,  the  complexity  from  the  designers  viewpoint  of 
this  operator  is  markedly  increased  because  of  the  added  strain  of  topologi- 
cal    manipulation.      We    shall    show    a    mechanism    called    planes     which 
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significantly  decreases  the  complexity  of  this  operation. 

4.4.3.  Align 

To  moves  an  object  to  one  side  of  a  reference  object.  A  similar  operator 
align,  moves  an  object  horizontally  or  vertically  until  the  point  specified  on 
the  object  matches  the  point  specified  on  the  reference  object. 

This  operator  is  used  to  center  objects  over  each  other.  For  example,  a 
poly-metal  contact  could  be  centered  over  a  metal  wire.  It  is  also-  used  as  a 
placement  operator;  the  pull_up.source  can  be  aligned  with  the  pull—down 
drain,  in  the  usual  manner. 

Otherwise,  this  operator  is  similar  in  effect  to  the  to  operator. 

4.4.4.  Drop 

The  equivalent  to  the  drop  operator  is  not  found  in  other  VLSI  CAD 
systems.  The  drop  operator  is  a  technique  for  manipulating  an  extra  degree 
of  freedom  called  planes. 

All  geometry  (wire  segments  and  primitives)  have  coordinates  for  their 
location  and  contain  layer  and  shape  information.  In  addition  to  this  infor- 
mation, each  piece  of  geometry  has  a  plane.  Objects  on  different  layers,  for 
example  diffusion  and  polysilicon  have  geometric  design  rule  interactions 
with  each  other.  In  contrast,  objects  on  different  planes  do  not  interact  with 
each  other,  while  objects  on  the  same  plane  do.    Hence,  the  design  rules  are 
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extended  by  another  -J-  dimension. 


n 


2 
1 
0 


Figure  4.6  —  Planes 


Planes  are  numbered  0,  1,  ...  ,  and  form  part  of  GENERIC'S  extended 
design  rules.  A  design  which  uses  multiple  planes  is  not  fabricatable;  a 
necessary  condition  for  fabrication  is  that  all  of  the  geometry  reside  on  plane 
0. 

Planes  enables  arbitrary  topological  manipulations  to  be  performed 
without  ever  violating  design  rules.    We  know  of  no  other  system  which  has 
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this  property. 

Figure  4.7  shows  a  layout  in  circuit  notation.  By  moving  one  of  the 
gates  to  be  interleaved  to  a  new  plsine  (by  the  drop  operator),  that  gate  can 
be  interleaved  with  the  gate  on  the  other  plane  by  the  to  and  move  opera- 
tors. 


(a)  original 


A      plane  2 


(b)  2  planes 


(c)  1  plane  "topology  modified 
Figure  4.7  —  Plane  manipulation  of  circuit 


The  paradigm   used  in  this  design  is  widely   used  in  GENERIC   algo- 
rithms.   The  object  on  the  first  plane  is  spread  apart,  essentially  converting 
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the  layout  into  a  sticks  representation.  There  is  then  plenty  of  room  to 
interleave  the  object  on  plane  2  in  the  desired  way  with  the  object  on  plane 
1,  taking  care  to  assure  that  no  primitive  on  one  plane  lies  at  the  same  coor- 
dinates as  a  wire  on  the  other  plane.  Bridges  are  made  where  wire  crosses 
wire.  Once  these  actions  are  performed,  the  object  can  be  dropped  onto  one 
plane.    Finally,  the  layout  can  be  compacted. 

4.4.5.    Orient  and  mirror 

Operators  are  provided  for  rotating  an  object  by  an  arbitrary  multiple  of 
90°,  and  for  mirroring  an  object  about  the  x  or  y  axis.  These  operators  are 
secondary,  in  comparison  to  the  move  and  to  operators,  and  hence  they  are 
rather  simple. 

Both  operators  disconnect  the  wires  attached  to  the  object,  perform  the 
transformation  and  call  a  wire  router  to  rewire.  In  the  case  of  orient,  if  the 
objects  are  spread  out  enough,  there  is  always  a  single  layer  route,  since  the 
topology  of  the  circuit  is  not  changed. 

Mirroring  changes  the  topology,  so  that  routing  cannot  be  guaranteed. 
However,  any  point  pairs  which  are  not  wired  are  returned  in  the  global 
variable  not-wired. 
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4.4.6.    Wrap 

GENERIC  is  capable  of  manipulating  only  manhattan  geometry.  How- 
ever, it  is  desirable  to  be  able  to  embed  in  GENERIC  designs  cells  which 
have  been  highly  optimized  with  arbitrary  angle  geometry,  for  example  a 
RAM  cell  which  is  heavily  replicated.  Another  desirable  property  is  to  have 
some  cells  not  deformable.  For  example,  in  dynamic  RAMs,  the  bit  is  stored 
as  a  charge  on  a  capacitor.  When  the  bit  is  to  be  read,  that  charge  is  charge 
shared  with  the  read  line,  and  is  picked  up  by  the  sense  amp.  These  designs 
do  not  follow  Mead/Conway  design  rules.  Changing  the  size  (and  hence 
capacitance)  of  the  read  line  will  result  in  a  circuit  which  will  not  work. 
Hence,  it  is  desirable  to  encapsulate  these  layouts  in  a  way  in  which  they 
will  not  be  accidentally  modified. 

This  encapsulation  is  performed  by  an  operator  called  wrap.  In  the 
case  of  non-manhattan  geometry,  a  manhattan  border  is  created  which 
overspecifies  the  geometric  design  rule  borders.  The  object  is  then  immut- 
able. 

Since  GENERIC'S  design  rules  assume  correct  construction,  they  need 
only  insure  that  changes  maintain  design  rule  correctness.  Hence,  the  pres- 
ence of  non-manhattan  geometry  (with  a  manhattan  interface)  causes  no 
special  problems  for  the  system. 
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CHAPTER    5 


Wire  Deformations 


5.   Introduction 

The  various  movement  operators,  move,  to,  and  align,  deform  wires  in 
an  attempt  to  maintain  connectivity.  In  the  following  discussion,  the  term 
translation  set  includes  the  set  of  objects  which  are  moving  at  the  current 
step  within  the  operator.  The  wire  frame  is  the  set  of  wire  segments  which 
connect  the  translation  set  to  objects  outside  the  translation  set.  These  wire 
segments  are  deformed  to  enable  the  translation  set  to  move.  The  environ- 
ment consists  of  all  geometry  which  is  neither  part  of  the  translation  set  nor 
wire  frame.  Finally,  v  is  the  set  of  objects  which  at  the  current  step 
obstruct  movement. 

There  are  several  forms  of  wire  deformations.  Stretch  increases  the 
length  of  a  wire.  Shrink  reduces  the  length  of  a  wire.  Kink  creates  jogs  in 
the  wire  obstructing  motion.  Stair  creates  jogs  in  the  translation  set,  allow- 
ing the  translation  set  to  "flow"  around  surrounding  geometry.  Include  aug- 
ments the  translation  set  by  the  wires  causing  obstruction.  Slide  allows  the 
point  at  which  a  wire  is  connected  to  the  geometry  to  change.  Finally, 
Break  physically    separates   a  wire   into   two   wire   segments   which   are  not 
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pinned  together. 

When  more  than  one  deformation  is  applicable,  the  one  with  highest 
(fixed)  order  of  precedence  is  selected.  Moreover,  each  deformation  can  be 
inhibited  either  by  turning  off  a  global  flag  for  that  deformation,  or  by  turn- 
ing off  a  flag  local  to  the  wire  segment.  This  enables  restrictions  to  be 
applied  both  locally  to  a  particular  design  and  globally  to  restrict  the  design 
style. 

5.1.    Shrink 

A  contact  cut  with  a  wire  attached  to  it  is  shown  in  figure  5.1. a.  If  the 
contact  cut  is  the  only  member  of  the  trsinslation  set,  then  the  wire  will  be 
in  the  wire  frame  set.  Moreover,  if  the  direction  of  movement  is  to  the  right, 
then  the  wire  can  shrink  (up  to  a  maximum  of  the  size  of  the  wire),  and  still 
maintain  connectivity,  as  shown  in  figure  S.l.b.  A  wire  which  is  shrunk  is 
always  part  of  the  wire  frame  set. 


Figure  5.1  —  Wire  shrinks  when  contact  moves  right. 
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5.2.    Stretch 

A  wire  which  will  shrink  when  the  motion  is  in  the  direction  d,  will 
stretch  when  it  is  in  the  direction  d  +  180°  (see  figure  5.2).  A  wire  which  is 
stretched  is  always  part  of  the  wire  frame  set. 


Figure  5.2  -  Wire  stretches  as  contact  moves  to  the  left. 


5.3.    Kink 

A  wire  which  is  orthogonal  to  the  direction  of  motion  and  which 
obstructs  the  movement  (and  hence  is  in  tt)  can  be  kinked.  Kinking  seg- 
ments one  wire  segment  into  as  many  as  5  wire  segments.  One  segment  is 
the  length  along  which  there  was  a  design  rule  conflict.  That  segment  is 
added  to  the  translation  set.  One  or  two  more  wire  segments  are  created 
which  are  of  zero  length  and  become  part  of  the  wire  frame  set.  These  wires 
will  stretch  in  the  next  step  of  the  operator.  Finally,  up  to  two  segments  are 
created  which  are  part  of  the  environment  and  will  remain  stationary.  A 
picture  is  shown  in  figure  5.3. 


84 


Figure  5.3  —  Wire  kinks  as  contact  is  moved  to  the  right. 


5.4.    Stair 

A  wire  in  the  translation  set  which  is  obstructed  by  an  object  in  the 
environment  can  be  staired.  The  wire  must  be  orthogonal  to  the  direction  of 
motion.  Stairs  are  calculated  by  reversing  the  role  of  environment  and 
translation  set  and  computing  a  kink.    An  example  is  shown  in  figure  5.4. 
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Figure  5.4  —  Wire  in  inverter  stairs  as  inverter  is  moved  to  the  right. 


5.5.    Include 


A  wire  which  is  in  the  environment  and  obstructs  motion  can  be  added 
to  the  translation  set. 
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Figure  5.5  —  Vertical  diffusion  wire  is  included  in  the  translation  set. 


5.6.    Slide 

Wires  are  pinned  to  other  wire  segments  at  their  endpoints,  or  they  may 
be  pinned  to  a  piece  of  geometry  on  a  primitive.  If  they  are  pinned  to  a 
primitive,  there  is  frequently  some  rsinge  of  points  at  which  the  wire  can  be 
pinned,  as  defined  by  the  primitives's  connection  regions. 

For  an  nMOS  standard  size  pull-up  shown  in  figure  5.6.  there  is  a  fair 
latitude.    When  the  pull-up  is  moved,  the  wire  need  not. 
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Figure  5.6  —  Wire  slides  along  pull-up  as  inverter  is  moved  down. 


5.7.    Break 

As  an  extreme  case  of  a  wire  deformation,  a  wire  which  cannot  be 
deformed  in  any  of  the  normal  ways  can  be  physically  broken  (and  hence 
lose  connectivity).  Since  it  is  very  easy  to  verify  that  objects  are  physically 
(or  electrically)  connected  in  GENERIC,  there  is  no  problem  of  the  break 
being  forgotten. 
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An  example  of  a  break  is  shovr-n  in  figure  5.7.    These  broken  wires  can 
then  be  reconnected  with  bridges  over  the  offending  wires. 


Figure  5.7  —  Wire  breaks  as  inverter  is  moved  passed  a  wire. 
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CHAPTER    6 


Examples 


6.   Introduction 

In  this  chapter,  examples  are  shown  from  the  nMOS  primitives  library, 
the  nMOS  cell  library,  and  the  set  of  cell  manipulators.  The  example  from 
the  nMOS  cell  library  include  an  inverter  and  a  superbuffer.  A  simple 
operator,  trim  from  the  set  of  cell  manipulators  is  also  shown. 

6.1.    Primitives 

The  nMOS  library  of  primitive  generators  can  produce  all  of  the  devices 
shown  in  [LyonSl].  These  include  contacts  between  any  two  wiring  layers, 
enhancement  and  depletion  transistors,  and  primitive  wires.  Primitive 
wires  differ  from  wire  segments,  and  are  useful  for  performing  certain  types 
of  manipulations  (as  we  shall  see  later  in  this  section).  In  an  analog  process 
capacitors  would  also  be  included.  These  devices  can  be  generated  in  a 
variety  of  sizes;  moreover  they  are  available  in  a  variety  of  shapes. 

Primitives  are  produced  by  procedures  which  are  known  to  be  correct, 
and  hence  need  not  be  checked  by  GENERIC.  Since  primitives  are  part  of 
the  "systems  programming"  for  a  particular  technology,  and  since  the  total 

descnpiion  of  primitives  for  that  technology  consumes  only  a  few  thousand 

90 


lines,  we  are  not  particularly  worried  about  the  ease  of  describing  primi- 
tives. Hence,  little  effort  has  be  provided  to  verify  correctness  or  reduce 
tedium  in  primitive  generators. 

Once  written,  a  generator  can  be  called  on  by  a  user  to  generate  one  of 
a  parameterized  family  of  devices.  In  the  current  implementation  of  GEN- 
ERIC, the  primitives  are  not  deformable.  It  would  be  desirable  to  modify 
the  shape  of,  for  example,  a  transistor  (see  figxire  6.1)  to  fit  tightly  within  a 
layout.  This  is  currently  accomplished  by  deleting  the  primitive  and  replac- 
ing it  with  another  primitive  which  serves  the  same  function. 
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Figure  6.1  —  Two  different  layouts  of  the  same  functional  transistor. 
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A  primitive  consists  of  the  following  components: 

1)  The  equivalence   class  of  circuit  functions  implemented   by  this  primi- 
tive. 

2)  The  physical  geometry  which  composes  the  primitive. 

3)  The  geometric  design  borders  which  will  be  'clashed'  against  other  prim- 
itives and  wires  geometric  design  borders. 

4)  A  set  of  points  which  are  "physically"  incident  on  the  primitive 

5)  The  regions  in  which  wires  may  be  connected  to  the  primitive 

6.1.1.    A  transistor  primitive  generator 

Figure  6.2  shows  the  primitive  generator  for  simple,  straight  transistors 
which  are  either  enhancement  or  depletion  depending  on  the  parameter 
type.   The  other  parameters  are  the  ch-w  and  the  c/i_A 
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transistor 

type  is  either  e  for  enhancement  or  d  for  depletion 

PrimitiveT  $t(float  ch_w,  float  ch_h,  str  type)  $t 

default  ch_w   :=  2.0;         default  ch_h  :=  2.0; 

if  (type  =  'e')  then 

$logical_equiv_class   :=  'enhancement-transistor'; 
elseif  (type  =   d)  then 

$logical_equiv_class   :=  'depletion_transistor'; 
else 

fretum  ('bad  transistor  type'); 
end; 

$path_orientation  :=  'w'; 

trans  :=  GeomCreateCt',  2.0,  2.0,  2.0  +  ch_w,  2.0  +  ch_h); 

pleft  :=  GeomCreateCp',  0.0,  2.0,  2.0,  2.0  +  ch_h); 

prght  :=  GeomCreateCp',  2.0  +  ch_w,  2.0,  4.0  +  ch_w,  2.0  +  ch_h); 

dbot    :=  GeomCreateCd',  2.0,  0.0,  2.0  +  ch_w,    2.0); 

dtop    :=  GeomCreatei'd',  2.0,  2.0  +  ch_h,  2.0  +  ch_w,  4.0  +  ch_h); 

$geom  :=  [  dtop  , 

pleft  ,  trans  ,  prght 

dbot]; 

$path    :=  [[trans.Sxl,  (trans. $yl +trans.$xl)/2.0], 
[trans.$x2,  (trans.$yl  + trans. Sxl)/2.0]]; 

$gdrs  +:=    GdrGeom(dtop)   +  GdrGeom( trans)   +  GdrGeom(dbot)   H 
[GdrCreateCp',  0,  0,  [pleft],  0.0,  2.0, 

4.0  +  ch_w, 
2.0  +  ch_h)]; 
if  (type  =  'd')  then 

ion     :=  GeomCreateCi',  1.0,  0.0,  3.0  +  ch_w,  4.0  +  c}L_h); 
Sgeom  +:=  [ion]; 
Sgdrs  +:=  GdrGeomdon); 
end; 

(./V"/U'  /)"//;/< 
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drain    :=  pointed',  2.0  +  ch_w/2.0,  4.0  +  ch_h,  dtop'; 
source  :=  pointCd',  2.0  +  ch_w/2.0.  0.0.  dbotj; 
gate     :=  pointCp',  0.0,  2.0  +  ch_h/2.0,  pleft); 
gate2    :=  pointCp',  4.0  +  ch_w,  2.0  +  ch_h/2.0,  prght); 

define  connection  region 
drain_conn    :=  ConnRegionCreateCd',  dtop,  2.0,  3.0  +  ch_h, 

2.0  +  ch_w,  4.0  +  ch_h); 
source_conn  :=  ConnRegionCreate('d',  dbot,  2.0,  0.0,  2.0  +  ch_w,  l.O'; 
gate_conn     :=  ConnRegionCreateCp',  pleft,  0.0,  2.0,  1.0,  2.0  +  ch_h); 
gate2_conn    :=  ConnRegionCreateCp",  prght,  4.0  +  ch_w,  2.0, 

4.0  +  ch_w,  2.0  +  ch_h); 

$conn_region  :  =  [source_conn,  drain_conn,  gate_conn,  gate2_conn]; 
equipotential(  [pleft,   prght]); 
end; 

Figure  6.2  —  A  simple  transistor  primitive  generator 


Defaults  are  provided  for  the  parameters  for  ch_w  (channel  width)  ajid 
ch_h  'channel  height),  if  the  call  to  $t  does  not  specify  them.  Since  pro- 
cedure calls  have  parameters  specified  either  by  order  or  by  keyword,  it  is 
sensible  to  specify  defaults  for  an  arbitrary  subset  of  parameters. 

A  logical  equivalence  class  defines  the  type  of  object.  When  the  class  is 
combined  with  some  set  of  device  dependent  parameters,  the  device  is  com- 
pletely characterized  functionally.  Assuming  that  there  is  no  erroneous 
information,  the  devices  having  the  same  eq\iivalence  class  and  device 
dependent  parameters  can  be  substituted  for  each  other  without  re- 
simulation. 
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The  variable  $path^orientation  is  an  impromptu  variable  and  simplifies 
the  computation  of  length-to-width  ratios,  and  other  information  by  transis- 
tor routines. 

The  next  five  assignments  create  the  geometry  that  forms  a  basic 
transistor.  Each  piece  of  geometry  has  a  layer,  and  lower  left  and  upper 
right  rectangular  coordinates.  These  assignments  also  create  5  impromptu 
variables,  which  can  be  used  to  specify  each  piece  of  geometry. 

The  set  of  geometry  consisting  of  these  five  variables  forms  the  physical 
device  and  is  stored  at  the  field  called  $geom.  $Geom  is  the  total  list  of 
geometry  which  must  be  transformed  when  a  linear  transformation  (transla- 
tion, orientation  or  mirroring)  is  applied  to  the  geometry  by  the  operators. 

The,  impromptu  variable  $path  gives  the  path  of  the  center  line  of  the 
transistors  channel  region.  Once  again,  the  purpose  of  this  variable  is  to 
enable  transistor  routines  that  deal  with  geometry  to  work  simply  and  uni- 
formly. 

The  geometric  design  rule  borders  are  defined  in  $gdrs.  These  $gdrs 
will  be  checked  against  those  of  other  primitives  and  wires  during  the  course 
of  operators. 

A  number  of  points  which  are  locations  "physically"  incident  on  the 
primitive  are  given.  These  points  do  not  restrict  the  places  at  which  wires 
can    be    joined     to    the    transistor    —    they     are    just     convenient     naming 
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mechanisms. 

Next  the  connection  regions  are  formed. 

Finally,  every  piece  of  geometry  in  GENERIC  is  assumed  to  inherently 
have  a  different  equipotential  (which  may  become  merged  when  two  objects 
are  connected).  However,  the  two  gate  regions  of  a  transistor  have  the  same 
inherent  (or  raw)  equipotential.  This  is  the  purpose  of  the  last  line  in  the 
generator. 

Implicitly  performed  by  the  primitive  generator  is  the  insertion  of  the 
primitive  as  a  component  of  the  most  recent  currently  "opened"  cell.  Hence, 
the  hierarchical  structure  is  created  without  explicit  designer  structuring. 

6.1.2.    A  wire  primitive  generator 

Wires  in  GENERIC  consist  of  collections  of  wire  segments  which  are 
joined  at  their  endpoints.  This  is  a  convenient  way  to  view  wiring  for  con- 
necting logic  components  in  a  "free  form"  way,  since  these  wires  are  deform- 
able  in  many  ways. 

However,  having  point-to-point  wires  makes  it  difficult  to  {perform  topo- 
logical manipulation  on  heavily  connected  wires  such  as  power  and  ground 
lines.  To  exchange  two  gates  sharing  power  and  ground  lines,  it  is  desirable 
to  change  the  points  at  which  the  gates  are  connected  to  these  lines  rather 
than  modify  the  pre-placed  lines.  For  these  wires,  it  does  not  matter  at 
what    point    the   component    taps   into   the   wire.     Moreover,    the   location    of 
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these  wires  tends  to  be  heavily  constrained  by  the  fact  that  there  are  many 
connections  to  them. 

These  types  of  wires  can  be  implemented  as  primit^'es,  look  like  wires 
(see  figure  6.3),  and  are  produced  by  the  simple  primitive  generator  sho%vn 
in  figvire  6.4. 


Figure  6.3  —  A  metal  wire  primitive 
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wire  primitive 

PrimitiveT  wprimistr  layer,  float  width,  float  height!  Wprim 

$logical_equiv_clas5   :=  'w-ire-prim'; 

describe  geometry 
cgeom    :=  GeomCreate( layer,  0.0,  0.0,  width,  height); 
$geom    :=  [cgeom]; 

in  terms  of  lower  left  and  upper  right  corner  points 
connect     :=  ConnRegionCreatedayer,   cgeom,  0.0,  0.0,  width,  height); 

$conn_region  :  =  [connect]; 
end; 

Figure  6.4  —  Wire  primitive  generator 


All  primitives  are  connected  to  other  primitives  by  wire  segments.  To 
perform  the  topological  change  of  switching  the  location  of  the  two  gates 
shown  in  figure  6.5,  a  simple  to  operator  is  applied  (without  requiring  a 
plane  change).  This  will  cause  the  wires  on  the  gate  to  slide  across  the  con- 
nection region  for  the  wire  primitive.  This  capability  is  of  particular  impor- 
tance in  supporting  heavily  used  global  signals  (clock  phases.  Ground,  Vdd) 
and  strips  type  layout  methodology. 
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Fi^^jre  6.5  -  Topological  manipulation  using  wire  primitive 
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6.2.    Cell  generators 

In  this  section,  two  cell  generators  from  the  Flexcell  library  are  dis- 
cussed. 

6.2.1.    Inverter 

The  simplest  gate  which  can  be  created  in  nMOS  technology'  is  an 
inverter.  A  general  philosophical  decision  is  to  have  low  level  devices  rela- 
tively complicated  in  order  to  facilitate  their  manipulation  and  utility  at 
higher  level  of  designs.    This  will  be  evident  in  the  design  of  the  inverter. 
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CellT  inverter! float  p<i_width,  float  pd_height,  float  pu_width, 

float  pu_height,str  compress)  inverter 

-DEFAULT  Z  ratio  of  4:1  as  per  M&C 

default  pd_width    :  =  4; 

default  pd_height  :  =  2; 

default  pu_width       :  =  2; 

default  pu_height     :=  4; 

default  compress      :=  'yes'; 

logical_equiv_class('inverter'); 

pull_up     :=  pull_up_trans(pu_width,      pu_height); 
pull-down  :=  pull_down_trans(pd_width,    pd_height); 

connect(pull_up. source,   pull_down. drain); 

in   :=  pt_alias(pull_down.gate); 
out  :=  pt_alias(pull_down.drain); 
not  :=  out; 
ioiin,    'i'); 
iotout,  'o'); 

--topological  and  geometric  information 

pull_up.source  ALIGN_HORIZONTAL&ALIGN_VERTICAL 

pull—down. drain; 
if  'fpulLup. source  DROP       pull_down. drain)  then 

errori'w',  'inverter:  bad  drop'); 
end; 

if  compress  =  'yes'  then  pulLup.source  MOVE_DOWN{}  10;  end; 
WirePath([pull_up. source,   pull_down. drain], 
'd',  pull_up.$prims[0].$plane); 


end; 


Figure  6.6  -  Inverter  cell  generator 
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The  inverter  is  specified  with  parameters  for  the  width  and  height  of 
the  pull-up  eind  the  pull-down.  A  fifth  parameter  is  compress  which  specifies 
whether  the  object  should  be  made  smaller  by  overlapping  geometry. 

The  next  grouping  of  information  provides  electrical  sind  naming  infor- 
mation for  the  inverter.  A  pull-up  and  a  pull-down  transistor  are  created 
with  the  appropriate  characteristics.  A  pull-up  has  its  drain  connected  to 
Vdd  and  its  source  connected  to  its  gate.  A  pull-down  has  its  source  con- 
nected to  Ground.  To  complete  the  inverter  electrically,  the  source  of  the 
pull-up  is  connected  to  the  drain  of  the  pull-down. 

Three  names  are  created  for  points  on  the  inverter.  In  and  out  refer  to 
the  obvious  points  on  the  underlying  transistors.  Not  is  merely  a  synonym 
for  out.  The  types  of  connections  allowed  to  the  geometry  at  that  point,  in 
terms  of  signal  type,  are  given  by  the  lo  calls. 

The  inverter  is  sufficiently  specified  to  be  simulated,  but  we  wish  to 
develop  an  initial  layout  as  a  starting  point  for  performing  the  geometric 
transformations. 

The  default  creation  of  each  primitive  is  on  a  new  plane  which  contains 
no  other  geometry.  Hence,  no  matter  what  coordinates  these  devices  axe 
created  at.  it  is  impossible  to  have  a  design  rule  violation.  While  this 
ensures  design  rule  correctness  it  is  still  desirable  to  have  a  single  plane  on 
which  a  lavout  for  the  device  exists.    This  is  part  of  providing  a  mental  pic- 
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ture  of  the  layout  status. 

Our  initial  layout   is  basically   that  found  in  Mead  and  Conway.     The 
pull-up  is  aligned  to  the  pull-down  so  that  the  source  of  the  pull-up  is  over 
the  drain  of  the  pull-down.    The  planes  are  merged  by  dropping  the  pull-up 
to  the  plane  of  the  pull-down.    The  resulting  transistor  is  taller  than  it  need 
be;  there  is  a  region  of  pull-up  source  which  can  overlap  with  the  pull-down 
drain  for  1\.    If  compress  is  'yes',  the  inverter  will  be  compressed  and  the 
transistor   regions   will   overlap.     Finally   a  physical   wire  is  created  which 
corresponds  to  the  electrical  net  formed  by  connect.    Although  this  wire  does 
not  cause  any   extra   mask  level   geometry   to  be  created   (it  is   under   the 
diffusion    area   of  two   other   transistors),    it   is   necessary    in   GENERIC    to 
ensure  that  the  connection  is  satisfied,  and  to  maintain  connectivity  if  opera- 
tors are  applied  to  the  inverter. 

One  inverter  generated  by  this  primitive  is  shown  in  figure  6.7.  Note 
that  unlike  most  layouts  for  an  inverter,  the  power  and  ground  are  not 
shown  in  metal.  The  best  location  of  and  routing  to  power  and  ground 
depend  on  the  environment  into  which  the  inverter  is  to  be  placed. 
GENERIC'S  modifiable  cells  enable  this  decision  to  be  postponed  until  the 
appropriate  time. 
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Figure  6.7  -  Inverter  produced  by  cell  generator  in  figure  6.6 
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Generators  for  nor,  nand.  and  nxor  gates  are  built  along  similar  lines. 

Unlike  other  languages  now  being  used  commercially  for  the  production 
of  cells[Nance83],  GENERIC  makes  very  little  use  of  absolute  coordinates,  or 
even  numbers.  Instead,  objects  are  placed  by  operators  using  relative  posi- 
tioning, and  distances  such  as  far,  very  far,  not  so  far  for  interdevice  spac- 
ing. 

6.2.2.    Super  Buffers 

A  more  complex  gate  type  is  a  super  buffer.  Super  buffers  are  circuits 
which  have  one  input,  and  produce  as  output  either  the  input  inverted 
(inverting  super  buffers)  or  the  input  unchanged  (non-inverting  super 
buffers).  Their  main  property  is  that  they  increase  the  speed  at  which  large 
(C)  output  loads  can  be  driven.  This  circuit  is  used  in  ratioed  logic  families, 
such  as  nMOS  where  the  speed  of  low-going  signals  is  significantly  different 
than  the  speed  of  high-going  signals.  By  "overdriving"  the  slower  of  the  two 
transitions,  approximately  equal  delays  are  attained  for  each  transition.  For 
driving  very  large  capacitances  such  as  clock  lines  or  external  pins  of  the 
chip,  these  super  buffers  are  cascaded  in  a  series  of  ever  increasing  sizes. 
Super  buffers  are  very  important  components  of  chips  since  they  aire  on  the 
critical  paths  and  since  they  consume,  on  for  example  the  MIPS  chip,  some- 
what over  1/3  the  total  power  budget  [HennesseySl]. 
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Because  of  the  need  to  drive  different  size  loads  and  because  of  the 
power  they  consume,  super  buffers  come  in  many  sizes  and  shapes.  A  prin- 
cipal goal  of  GENERIC  is  to  enable  the  generation  of  cells  which  are  as  good 
as  those  that  would  be  designed  by  hand  for  the  use  in  which  that  cell  is 
employed.  For  super  buffers,  this  means  that  the  generator  should  describe 
constraints  in  terms  of  sp>eed,  power,  size  eind  shape,  and  loading  signal. 

The  generator  must  then  generate  a  circuit  which  will  meet  these  con- 
straints if  it  is  possible  to  do  so.    Questions  that  must  be  answered  are: 

•  What  is  the  size  of  the  initial  transistor  for  the  circuit  and  does  this  cir- 
cuit drive  a  super  buffer  or  another  inverter? 

•  How  many  stages  are  required  and  what  is  the  increase  in  size  from  one 
stage  to  another? 

•  What  initial  template  should  be  chosen? 

•  How  should  transistors  be  shaped? 

•  What  is  the  aspect  ratio  of  the  inverter? 

The  circuit  description  for  a  super  buffer  is  simple,  and  is  shoxsn  in 
Mead  and  Conway.  When  computing  optimal  super  buffers,  the  amount  of 
power,  desired  rise  and  fall  times,  and  the  loads  placed  on  the  output  need  to 
be  known.  These  are  all  circuit  design  issues  and  have  been  examined 
extensively. 

The  circuit  description  consists  of  a  list  of  transistor  sizings.  The  con- 
nection of  the  transistors  is  always  the  same.  Hence,  given  the  circuit  the 
only  degree  of  freedom  if  the  layout    including  trsmsistor  shapings'. 
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The  super  buffer  generator  in  the  Flexcell  Ubrary  contains  a  prolog  to 
compute  the  circuit  to  be  produced,  and  creates  a  simulatable  circuit.  From 
this  circuit,  any  one  of  the  three  topological  templates  that  the  cell  generator 
knows  about  can  be  produced. 

A  topological  template  is  the  topological  plan  for  producing  a  sticks 
level  design.  Unlike  sticks  level  descriptions,  a  template  is  created  within 
the  generator.  Hence,  parameterized  sizes  (number  of  stages)  and  func- 
tionality (inverting  or  non-inverting)  are  possible.  Templates  represent 
plans;  they  may  be  modified  and  they  certainly  will  need  to  be  compacted. 

Three  pictorial  representations  of  the  templates  are  shown  in  figure  6.8. 
These  templates  differ  in  the  location  and  number  of  power  and  ground 
lines.  Figure  6. 8. a  shows  a  template  for  one  power  and  one  ground  line, 
6.8.b  for  one  power  and  two  ground  lines,  and  6.8.c  for  two  power  and  two 
ground  lines.  Note  that  while  the  figure  shows  a  fixed  number  of  stages,  the 
template  produces  an  arbitrary  number  of  stages. 
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(a)  single  power  and  ground  (c)  double  power  and  ground 


(b)  single  power  and  double  ground 


Figure  6.8  —  Topological  templates  for  super  buffers 
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The  GENERIC  code  for  producing  the  template  in  figure  6.8.b  is  shown 
in  appendix  A.  The  template  is  several  pages  long  and  is  rather  tedious  to 
describe.  However,  once  created,  super  buffers  m  many  varieties  can  be 
created;  see  the  layouts  in  figure  6.9. 
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Figure  6.9  -  Layouts  of  super  buffers 
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6.9  >bi 
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6.3.  Cell  manipulators  Cell  manipulators  change  the  layout  of  an  object 
and  an  initial  layout  has  been  generated.  Three  types  of  cell  manipulators 
are  described  here.  The  first  is  a  manipulation  which  uses  planes  to  effect  a 
topological  modification.  The  second  is  operator  which  removes  excess 
geometry.    Finally,  a  simple  compactor  is  shown. 

6.3.1.    Plane  manipulations 

GENERIC'S  plane  mechanism  allows  objects  to  exist  at  overlapping 
coordinates  without  creating  design  rule  violations  if  they  are  on  different 
planes.  This  property  allows  our  design  rules  to  never  be  violated,  even 
when  topological  manipulations  are  being  performed. 

An  example  of  topological  manipulation  is  shoNMi  in  figure  6.10.  sind  the 
design  specific  code  that  effected  the  manipulation  is  shown  in  figure  6  11. 
Two  nand-gates  are  shown  with  their  pull-up  drains  connected  together  and 
the  input  to  the  inverter  is  shsu-ed  with  the  input  to  the  three-input  nand 
(figure  6.10  (aj).  In  the  second  frame  (6.10  (bl).  the  three-input  nand  has 
been  placed  to  the  right  of  the  inverter  which  seems  to  cause  two  anomalous 
situations:  the  inverter  has  a  ground  diffusion  line  passing  through  its  pull- 
up  and  the  polysilicon  line  crosses  through  the  two-input  nand.  However. 
not  sisible  in  this  flat  representation  is  the  fact  that  the  poly  wires  au^e  on 
one  plane,  the  diffusion  wires  are  on  another  plane,  and  the  gates  are  on  a 
third   plane.     'Note   that   any   mapping   of  wire   segments   and   primitives   to 
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planes  is  acceptable).    Hence,  these  apparent  overlaps  do  not  really  occur. 

In  the  third  frame  (6.10  (c)),  the  two-input  nand  is  moved  to  the  left.  In 
the  fourth  frame  (6.10  (d)),  the  inverter  is  mirrored  about  the  point  where 
the  polysilicon  wire  has  been  connected  to  the  pull-doNvn  gate.  This 
effectively  changes  the  side  that  the  poly  wire  is  connected  to  the  inverter 
from  left  to  right. 

In  the  fifth  frame  (6.10  (e)),  the  leftmost  poly  wire  is  moved  right  until 
it  reaches  the  parallel  poly  wdre  and  the  bottom  most  wire  is  moved  up  until 
it  reaches  the  height  of  the  parallel  poly  wire.  Finally,  in  the  sixth  frame 
(6.10  (D),  the  inverter  is  moved  down.    The  circuit  can  now  be  compacted. 

It  is  not  our  intention  that  designers  write  design  specific  code  (as  this 
example    is)    to    perform    topological    manipulations.      Instead,    we    believe 
GENERIC'S  constructs  provide  a  fruitful  starting  ground  for  the  design  of 
topologically  modifying  cell  manipulators. 
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Figure  6.10 


-  Topological  manipulations  using  planes 
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nandl  TO_RIGHT{by:  =  L(50)}  inv; 
GrNextDrawO;  --  figure  6.10  (b) 

nand2  TO_LEFT{by:  =  L(50i}  inv; 
GrNextDrawO;  -- figure  6.10  (cl 

[xl,yl]  :=  PtXY(inv.in); 
[x2,y2]  :=  PtXY(nandl.inO); 
MIRROR{x:  =  xl,  remove:  = 'no'}  inv; 
GrNextDrawO;  -  figure  6.10  (di 

if  wires[0].$pl,$y   ^  wires[0].Sp2.$y  then 

w3  :=  wires[0];. 
else 

w3  :=  wires[l]; 
end; 

w2  IS  the  bottommost  horizontal  wire 

u'3  IS  the  leftmost  vertical  wire 
w2  MOVE_UP      (y2  -  w2.$pl.$y); 
w3  MOVE_RIGHT  (xl  -  w3.$pl.$x); 
GrNextDrawO;  --  figure  6.10  <ei 

inv  ALIGN_VERTICAL{ptl:  =  inv.in,   pt2:  =  nandl. inO}  nandl. inO; 
GrNextDrawO;  -- figure  6.10  (fi 

lisp('GdEndPlot'); 
end; 

Figure  6.11  —  Topological  manipulations  using  planes  code 


6.3.2.    Trimming 

A  heavily  used  technique  in  GENERIC  is  to  use  space  generously  in  the 
manipulation  of  objects.  Spreading  objects  far  apart  enables  the  designer  to 
perform   manipulations   \Mthout   being  concerned  about  design  rule  confiic:.- 
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with  other  objects  (other  than  the  objects  lA'iresi.  Tight  designs  are  then 
achieved  by  using  general  purpose  compaction  to  reduce  the  object  to  reason- 
able size. 

Spacing  is  one  way  of  achieving  this  technique.  Another  method,  used 
in  conjunction  with  primitive  wires,  is  to  make  the  wire  very  long  to  give 
sufficient  room  for  manipulation.  However,  since  the  wire  used  in  this  case 
is  a  primitive,  when  compaction  is  applied  this  wire  will  not  reduce  in  size. 
It  is  necessary  to  physically  trim  this  primitive,  which  can  be  effected  by 
applying  tnm  to  the  wire  in  question. 

A  skeleton  of  the  trim  code  is  shown  in  figure  6.12.  The  operator 
iterates  over  all  wire  segments  physically  connected  to  the  wire  primitive, 
and  computes  minimum  and  maximum  bounds.  The  object  is  then  physi- 
cally changed. 
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trims  a  primitive  which  consists  of  one  piece  of  geometry. 

unspec 
unary  _op 

TRIMlDirectionT  direction)  TRIM 

float  xl;  float  yl;  float  x2;  float  y2; 
int  coor; 

float[2]  interval; 
float  delta_l;  float  delta_2; 

-- forall  primitives  in  the  set  indicated  by  opl  ithe  operand  of 
—  this  unary  operator)  such  that  the  primitive  is  a  wire-prim 
V  (X  6  ObjSet(opl)  st  type(xj  =  'PrimitiveT'  & 

x.$logical_equiv_class    =  'wire_prim") 
[xl,yl,x2,y2]   :=  GeomSize(x.cgeom); 
if  (x2-xli   >  (y2-yl)  then 

coor  :=  0;  --horizontal  wire 
else 

coor  :=  1; 
end; 

interval  :  =  [°c,  —^]\ 

--forall  objects  pinned  to  the  only  geometric  rectangle 

in  the  wirc-prim  (these  must  all  be  wires',  expand  the 
interval  along  the  axis  to  ensure  that  the  the  interval 
can  hold  the  wire  endpoint. 
--p.Sobject  is  a  wire 

V  ip  €  Pinlncident(x.cgeom)  st  ps^x.cgeom) 
bbox  :  =  WireSize(p.$object); 
intervaUO]  :=  intervaUO]  MIN  bbox[coor]; 
intervaUl]  :=  interval[l]  MAX  bbox[coor  +  2]; 
end; 

--in  place  modification  of  wires.    This  is  the  low  level 

structure  munging  that  wouldn't  normally  he  done  in 
GENERIC.    However,  this  was  so  simple  I  couldn't  resL.<t' 

if  coor  =  0  then 

delta_l  :=  intervaUO'  —  cgeom.Sxl: 
delta_2  :=  intervalll'  —  cgeom.Sx2; 
x.cgeom.Sxl    -r:=  delta_l: 
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else 


x.cgeom.$x2    +:=  delta_2; 

V  [g  6  x.Sgdrs) 

g.$xl   +:=  delta_l; 
g.$x2   +:=  delt£LJ2; 
end; 

V  ic  €  x.$conn_reg) 

c.Sxl   +:=  delta_l; 
c.$x2  +:=  delta_2; 
end; 

delta_l  :=  intervaUO]   —  cgeom.Syl; 
delta_2  :=  intervaUl]   —  cgeom.Sy2; 
X. cgeom.Syl    +:=  delta_l; 
x.cgeom.$y2   +:=  delta__2; 

V  (g  6  x.Sgdrs) 

g.Syl   +:=  delta_l; 
g.$y2  +:=  delta_2; 
end; 

V  (c  i  x.$conn__reg) 

c.$yl  +:=  delta_l; 
c.$y2  +:=  delta_2; 
end; 


end; 
end, 
end; 


Figure  6.12  —  Trim  code 


121 


*k 


«t.v 


FigTare  6.13  —  Trim  example 


Note  that  trim  is  but  one  example  of  a  family  of  routines  for  removing 
excess  geometry.  Another  case  where  such  a  routine  could  be  used  is  for  a 
cell  which  is  to  be  highly  replicated.  Such  a  cell  is  typically  designed  to  con- 
nect by  abutment;  the  shift  cell  in  figure  6.14  has  a  contact  cut  necessary  to 
effect  a  layer  change  necessary  for  interior  cells.  However,  for  cells  on  the 
periphery,  this  represents  extra  geometry  and  extra  capacitance.  A  simple 
operator  can  iterate  over  the  design  and  remove  this  geometr>-. 
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Figxire  6.14  —  Shift  cell  with  extra  contact  at  periphery 

However,  one  must  keep  in  mind  the  interaction  between  design  styles 
and  operators.  Some  design  styles  (such  as  used  in  dynamic  RAMsi  would 
result  in  incorrect  circuits  if  "excess"  geometry  were  removed.  For  this  rea- 
son, the  wrap  operator  will  prohibit  this  type  of  operation. 


123 


6.3.3.    Compaction 

The  final  example  in  this  chapter  is  a  very  simple,  general  purpose  com- 
pactor. The  compactor  takes  the  minimum  object  it  can  find  (the  list  con- 
taining this  object  is  called  the  seed)  and  moves  the  seed  up  until  it  reaches 
an  object  which  interferes  with  it.  That  interfering  object,  which  is  avail- 
able in  the  global  variable  tt  is  then  added  to  the  seed  and  the  process  is 
repeated  until  every  object  in  the  critical  path  of  the  original  seed  is  com- 
pacted. The  algorithm  then  picks  the  minimum  object  remaining,  and  com- 
pacts that,  until  the  minimum  object  so  picked  has  already  been  compacted. 

The  compactor  functions  in  terms  of  three  of  one-dimensional  passes, 
and  performs  an  X-Y-X  compaction.  The  first  one  pushes  objects  up,  the 
second  pushes  objects  to  the  right  and  the  final  pass  pushes  objects  down. 

The  code  for  the  compactor  is  shown  in  appendix  B.  An  example  of  the 
first  one-dimensional  phase  of  the  compaction,  performed  on  a  superbuffer  is 
shown  in  figure  6.15.  with  each  frame  representing  another  object  added  to 
the  seed. 
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Figure  6.15  -  Compaction  of  a  super  buffer 
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In  the  phases,  we  have  prohibited  the  adding  of  kinks  and  stairs  since, 
this  te"nds  to  produce  a  lot  of  extra  kinks  and  stairs  which  are  undesirable. 
A  more  sophisticated  compactor  would  first  perform  an  X-Y-X  compaction  on 
non-kinking  and  non-stairing  wires  emd  then  produce  an  X-Y-X  compaction 
with  kinking  and  stairing.  Other  experiments  to  be  performed  would  be  for 
pattern  matching,  for  example  the  horizontal  poly  lines  (of  the  same  net) 
could  be  moved  closer  together  by  sliding  them  along  the  transistor  gates. 
While  this  would  not  make  the  circuit  any  more  compact,  it  would  reduce 
the  capacitance  on  the  poly  wire  by  shortening  its  vertical  segment.  A  more 
ambitious  transformation  would  be  to  better  distribute  components  away 
from  the  critical  path. 

These  compaction  programs  are  all  trivial  to  write  in  GENERIC. 
Unfortunately,  the  current  implementation  of  GENERIC  is  too  slow  to  allow 
extensive  experimentation. 
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CHAPTER    7 


Efficiency 


7.    Introduction 

A  CAD  system  is  useless  if  it  is  not  efficient.  The  GENERIC  program- 
ming language  implementation  is  vastly  inefficient,  and  hence  GENERIC  is 
suitable  only  for  performing  small  examples.  The  system  described  in  this 
thesis  is  composed  of  about  30,000  lines  of  source  code.  We  estimate  that  an 
efficient  implementation  would  require  about  10,000  additional  lines  of  code. 
However,  these  limitations  are  implementation  based  —  we  shall  discuss  the 
inherent  efficiency  considerations  in  GENERIC. 

GENERIC  includes  many  embedded  mechanisms  to  perform  design  rule 
safe  manipulations.  One  crucial  efficiency  question  is  how  much  does  sup- 
port for  these  mechanisms  cost?  A  second  question  involves  the  new 
mechanism  of  planes  and  its  effect  on  efficiency.  It  appears  that  plsmes  will 
increase  the  efficiency  of  manipulations  since  the  number  of  objects  to  be 
considered  is  reduced  to  only  those  which  are  relevant.  A  third  question  is 
how  the  layout  styles  at  the  Flexcell  and  cell  manipulator  level  relate  to 
efficiency.  This  is  the  hardest  question  of  all,  and  (hence)  beyond  the  scope 
of  this  thesis. 
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7.1.    Efficiency  of  movement  operators 

It  is  difficult  to  assess  the  efficiency  of  the  operators,  since  they  depend 
on  parameters  which  can  only  be  determined  accurately  experimentally. 
However,  we  shall  try  to  give  some  insight  into  the  efficiency  of  a  central 
operator,  move,  which  is  as  complex  as  any  other  operator,  and  which  is 
expected  to  be  the  most  heavily  used  operator. 

The  primary  measure  of  interest  is  the  time  it  takes  to  perform  the 
geometric  design  rule  checks  inherent  in  an  operator  given  that  we  already 
have  obtained  an  ordered  set  of  the  objects  of  interest.  Means  of  quickly 
finding  that  set  are  described  in  the  section  "mechanisms  for  increased 
efficiency". 

A  geometric  design  rule  check  of  two  geometry   rectangles   takes  con- 

» 

stant  time.  Moreover,  the  average  number  of  geometric  rectangles  compris- 
ing a  primitive  is  quite  small  and  is  almost  independent  of  size  of  design. 
(The  only  primitive  which  increases  with  size  of  design  are  transistors  driv- 
ing large  loads,  and  there  are  very  few  of  these  because  of  the  power  they 
consume.) 

Thus,  the  efficiency  depends  on  the  number  of  object-to-object  checks 
required.  We  divide  the  operation  into  steps.  During  each  step,  the  transla- 
tion set  fts)  moves  part  of  the  desired  distance  and  is  increased  in  size  by 
obstructing  wires.    Assume  that  the  objects  in  the  environment  are  all  rec- 
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tangles  and  are  sorted  in  order  of  increasing  design  rule  distance  from 
objects  in  fs  in  the  direction  of  motion.  We  define  design  rule  distance  as 
the  distance  of  the  geometry  from-  other  geometry  which  it  might  have  a 
design  rule  interaction.  For  example,  metal  has  a  design  rule  interaction 
with  other  metal,  but  not  with  diffusion.  Then,  at  worse,  every  object  in  ts 
is  checked  against  the  closest  object  in  the  environment.  Once  an  object  in 
ts  is  checked  against  the  current  object  in  the  environment,  that  object  will 
never  have  to  be  checked  again  since  it  either: 

becomes  part  of  ts ,  or 

is  passed  by  that  element  of  ts  (has  negative  distance). 

To  obtain  an  efficient  algorithm,  we  want  to  check  objects  in  ts  only 
with  those  objects  in  the  environment  which  are  close.  Let  min^distance  be 
the  time  it  takes  to  find  the  nearest  design  rule  interaction  to  an  object  in 
the  direction  of  motion.  (We  will  see  some  methods  for  computing  these 
interactions  which  have  close  to  constant  time  cost). 

The  number  of  design  rule  checks  at  each  step  is  the  size  of  the  current 
ts  icurr^s).   Hence,  the  time  for  each  step  is  0{\curr^ts\*min^distan.ce). 

Let  n  =  \ts\  +  \environment\.  Then  there  are  at  most  0{n)  steps,  each 
step  taking  at  most  OinXmin^distance).  Hence,  the  complexity  of  the 
operation  is  Oin'^Xmin^distance). 
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In  practice,  the  complexity  should  be  lower  than  this  number,  since  if 
the  ts  is  moving  to  the  right,  the  object  can  be  stratified  vertically,  so  that 
objects  which  do  not  share  any  vertical  interval  need  not  be  checked  against 
each  other.  This  would  seem  to  reduce  the  complexity  by  a  factor  of  n  (since 
the  objects  are  usually  quite  small).  It  also  simplifies  the  min^distances 
requirement. 

Thus,  we  expect  in  principle  that  these  operations  could  be  made  quite 
efficient. 

7.2.    Mechanisms  for  increased  efficiency 

In  this  section,  we  examine  several  methods  for  quickly  finding  and  ord- 
ering the  relevant  geometry.  This  geometry  becomes  the  grist  of  the  opera- 
tors. 

7.2.1.    Bounding  boxes 

One  of  the  earliest  techniques  to  increase  efficiency  is  the  use  of  bound- 
ing boxes  to  describe  a  containing  rectangle  for  the  geometry.  The  hierarch- 
ical description  of  the  chip,  which  is  even  part  of  CIF,  is  instrumented  with 
bounding  boxes.  Only  those  branches  of  the  tree  which  are  of  interest  need 
to  be  examined. 

There  are  two  problems  with  this  approach.  One  is  that  long  wires  can 
artificially  increase  the  number  of  interactions  dramatically   by  increasing 
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the  size  of  the  bounding  box  at  high  levels  in  the  tree.  Hence,  some  alterna- 
tive means  of  handling  long  wires  is  needed,  or  some  way  is  needed  of  clus- 
tering long  wires  together  in  one  small  branch  at  the  top  of  the  tree. 

Clustering  wires  together  near  the  root  of  the  tree  could  be  detrimental, 
since  the  main  purpose  of  the  hierarchy  is  not  to  decrease  computations  but 
to  allow  the  designer  a  simpler  way  of  looking  at  the  design.  These  distor- 
tions hinder  that  abstraction. 

A  second  problem  is  that  hierarchically  related  objects  should  (for 
efficiency)  be  near.  Two  small  objects  on  opposite  sides  of  the  chip  (but  in 
the  same  branch)  will  be  checked  often,  even  though  the  percentage  of  chip 
area  they  consume  is  small  since  the  bounding  box  is  large.  This  is  one  rea- 
son why  physical  hierarchy  is  often  forced  to  match  logic  hierarchy. 

7.2.2.    Quad-CIF  trees 

A  tree  of  bounding  boxes  described  in  the  previous  section  is  the  obvious 
attempt  to  attain  a  logarithmic  algorithm  but  is  dependent  on  the  shape  of 
the  data  for  recursively  partitioning  the  area  and  on  other  properties  of  the 
data  which  may  not  behave  as  expected.  An  alternate  approach  is  to  take 
the  inverse  mapping  from  sub-regions  of  the  surface  to  bounding  boxes  of 
objects.  Hence  if  a  manipulation  requires  some  operation  in  a  region,  the  set 
of  all  objects  within  that  region  are  selected.  This  is  important  since  the 
usual  question  is:  what  is  the  design  rule  interaction  of  a  given  object  with 
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the  rest  of  the  design?    Described  here  is  a  variant  called  Quad-CIF  trees 
[Kedem82]. 

The  regions  are  computed  by-  taking  the  bounding  rectangle  of  the 
design,  and  subdividing  it  into  four  rectangles  by  bisecting  the  rectangle 
horizontally  and  vertically.    This  operation  is  then  applied  recursively. 

An  object  belongs  to  a  level  of  the  tree  if  the  subdivisions  intersect  the 
object.  Since  objects  tend  to  be  quite  small  [BentlySO],  there  are  not  too 
many  objects  which  are  subdivided  at  high  levels  of  the  tree.  Moreover,  to 
maintain  the  asymptotic  performance  of  the  algorithm,  objects  which  are 
bisected  by  horizontal  (vertical)  lines  are  kept  in  a  tree  of  x  (y)  coordinates. 
Also,  in  this  representation,  long  wires  are  easily  checked  since  they  occur 
high  in  the  tree. 

Given  the  previous  discussion,  the  time  taken  to  find  an  object  in  a 
specific  region  is  Oilogn). 

7.2.3.    Magic's  tiles 

Another  representation,  invented  for  the  graphics  editor  Magic,  uses  a 
technique  called  corner  stitching  [Ousterhout84].  The  technique  divides  up 
a  plane  into  a  covering  of  disjoint  tiles.  Each  tile  contains  pointers  (at  upper 
right  and  lower  left  corners)  to  the  two  tiles  adjacent  to  each  corner,  see 
figxire  7.1. 
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Figure  7.1  —  Tiling  using  corner  stitching 


Corner  stitching  allows  very  efficient  operation  of  the  most  commonly 
needed  tasks.  These  include:  finding  all  tiles  in  a  given  regions;  finding  all 
tiles  adjacent  to  a  given  tile;  and  traversing  a  connected  region  of  tiles. 

Since  the  object  on  which  a  manipulation  to  be  performed  is  already 
pointed  at,  these  operations  become  essentially  constant  time  operations, 
and  hence  are  more  effective  than  the  logn  operations  of  Quad-CIF  trees. 
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7.3.    Use  of  existing  techniques  with  planes 

The  plane  mechanism  and  attendant  design  style  will  have  a  significant 
impact  on  the  efficiency  of  the  operators.  •  On  one  hand,  planes  enable  the 
segregation  of  relevant  data  to  an  a  segregated  space  in  which  other 
geometry  becomes  irrelevant,  and  thus  decreases  the  size  of  the  problem. 
On  the  other  hand,  the  spread-out,  compact-together  design  strategy  will 
mean  that  the  locations  of  (and  data  structures  for)  objects  are  much  more 
dynamic. 
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CHAPTER    8 


Conclusion 


8.    Conclusion 

8.1.    Contributions 

The  GENERIC  Programming  Language  is  a  tool  to  explore  VLSI  layout 
problems.  As  discussed  before,  the  real  question  is  whether  within  the  pro- 
gramming language  approach  enough  layout  tools  can  be  built  to  compen- 
sate for  the  fact  that  we  are  now  "flying  blind".  Our  instrumentation  must 
provide  sufficient  feedback  so  that  we  know  where  modifications  must  be 
made  to  the  layout.  Furthermore,  we  must  have  sufficiently  stylized  our 
response  to  these  measurements  so  that  we  are  able  to  effectively  determine 
and  apply  those  modifications  to  improve  the  layout. 

We  have  argued  that  the  irregularity  of  the  domain  makes  it  difficult  to 
solve  the  problem  purely  as  a  combinatorial  one.  We  have  also  argued  that 
graphics  systems  will  become  decreasingly  able  to  meet  the  requirements  as 
scaling  approaches  Ultra  Large  Scale  levels  of  integration. 

We  believe  that  GENERIC  is  a  step  in  the  right  direction.  GENERIC 
presents  several  novel  contributions: 
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A  new  mechanism  called  planes  which  enables  topological   modification 
while  never  violating  geometric  design  rules. 

Ability  to  modify  existing  layout  with  high  level  operators. 

A  complete  system  in  which  all  op>erations  are  design  rule  saSe  and  con- 
nectivity maintaining. 

Ability  to  mix  general  purpose  and  special  purpose  layout  tools  within 
the  same  framework. 

Integration  of  circuit  and  layout  views. 

Derivation  of  layout  from  circuit. 

Enhanced  naming  mechanisms. 

We  have  also  described  a  software  airchitecture  which  we  believe  will 
result  in  a  VLSI  layout  system  which  is  much  more  capable  than  today's 
systems. 

8.1.1.    Cell  Generator  Libraries 

The  GENERIC  Layout  System  unifies  the  concepts  of  cell  libraries,  cell 
generators  and  cell  compactors.  It  is  clear  that  cell  libraries  written  in 
GENERIC  will  be  much  more  adaptable  than  those  produced  with  such 
graphics  systems  as  Magic  since  cells  are  custom  generated  for  each  use 
instead  of  static  entities.  Cells  produced  with  GENERIC  can  be  parameter- 
ized for  structure  (ex.  number  of  inputs),  for  functionality,  and  circuit  design 
(transistor  sizing).  These  cells  can  be  modified  a  posteri  in  GENERIC 
without  accidentally  changing  the  function  of  the  cell. 

Historically,  stsmdaird  cells  and  generators  have  been  separated  because 
of  the  wavs  in  which  each  were  created.    Standard  cells  were  created  with 
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graphics  editor  while  generators  were  programs  which  produced  layout. 
Additionally,  standard  generator  technology  was  insufficient  to  produce 
irregular  designs,  which  are  required  in  every  non-trivial  VLSI  design. 

In  practice,  this  resulted  in  two  different  design  methodologies.  Stan- 
dard cells  were  the  corollary  of  standard  TTL  parts.  These  highly  optimized, 
but  fixed  cells  were  to  be  the  building  blocks  for  systems.  Generators  pro- 
duced customized  functionahty,  but  only  at  the  cost  of  highly  structured, 
regular  layouts.  The  output  of  these  generators,  typically  PLAs,  ROMS, 
RAMs  form  a  necessary  but  not  sufficient  capability  for  the  automatic  pro- 
duction of  chip-wide  layouts. 

Our  goal  is  to  provide  generators  which  are  an  order  of  magnitude  more 
flexible  than  those  ciirrently  available.  Since  cells  are  hierarchically  compo- 
sible,  such  a  system  would  be  capable  of  chip-wide  layout.  A  suitable  test 
bed  would  be  a  parallel  processor/software  development  where  changes  to 
the  software  might  result  in  changes  to  the  processor  design,  or  in  silicon 
compiler  applications. 

8.1.2.    Cell  manipulators 

In  GLS,  we  have  decoupled  those  areas  which  are  technology  dependent 
from  those  which  are  technology  independent.  We  have  discussed  a  different 
kind  of  dependence  than  is  discussed  in  other  work  —  a  qualitative  technol- 
ogy   dependence     which    represents     learned     good    design     practices     and 
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techniques  which  are  specific  to  a  technolog\'.  We  have  attempted  to  isolate 
the  technology  dep)endent  functions  in  the  cell  generator  library  and  distill 
out  the  general  purpose,  technolog>'  independent  operations  into  a  cell  mani- 
pulator library. 

We  believe  the  cell  manipulator  library  will  make  designs  more  port- 
able between  different  fabrication  facilities  for  similar  technologies,  and  will 
make  more  of  a  common  base  of  the  design  system  when  building  cell 
libraries  for  different  technologies.  Examples  of  desirable  cell  manipulators 
are: 

•  compactors 

•  topology  modifiers 

•  critical  path  reorganizers 

8.1.3.    Layout  Utilities 

Unix(tm)  [Ritchie78]  is  a  system  which  has  gained  widespread  use  as  a 
development  system  for  programmers.  Among  other  features,  the  Unix 
operating  system  provides  a  large  tool  box  of  utilities  that  perform  small 
atomic  functions  which  can  be  cascaded  to  achieve  some  more  complex  func- 
tion. While  any  individual  composition  of  tools  could  be  more  efficiently 
subsumed  by  a  hand  crafted  program,  the  environment  for  which  Unix  was 
created  is  filled  with  one-time  tasks. 
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These  tools  provide  a  very  effective  method  for  performing  these  tasks, 
and  their  modularity  provides  concise  building  blocks.  For  the  VLSI 
designer,  there  do  not  exists  similar  tools.  -The  closest  thing  designers  have 
is  automatic  compaction  tools.  However,  in  these  tools,  typically  the  input 
format  is  different  from  the  output  format.  Hence,  when  a  designer  t\ines 
his  design,  he  must  decide  what  feature  in  the  input  caused  the  problem  in 
the  output,  change  the  input  and  renin  the  compactor.  To  accomplish  this 
task  he  must  have  an  accurate  model  of  how  the  compactor  works. 

For  true  Unix  style  tools  the  following  criteria  must  be  met: 

(1)  the  input  format  must  be  the  same  as  the  output  format. 

(2)  the  input  format  must  be  reasonably  close  to  the  semantics  of  the  tool. 
In  other  words,  large  amounts  of  processing  time  cannot  be  spent  in  con- 
verting from  one  format  to  another. 

GENERIC  data  structures  satisfies  both  of  these  requirements. 
8.2.    Further  work 

8.2.1.    Efficiency 

Without  a  doubt,  the  largest  problem  in  using  GENERIC  is  its  excruci- 
atingly slow  speed.  This  problem  is  evident  on  two  levels:  the  ability  of  the 
compiler  to  produce  efficient  code  and  the  algorithmic  complexity. 

Since  the  program  is  extremely  slow  even  for  small  input,  the  most 
pressing  need  is  to  speed  up  execution  of  compiled  code.  This  will  be  done 
by  generating  C  as  a  target  language,   by  writing  a  well  tailored   garbage 
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collector  which  should  be  vastly  more  efficient  than  the  compacting  garbage 
collector  in  Franz  Lisp  [SklowerSl].  Other  major  speed  improvements  v^ill 
come  from: 

•  More  efficient  access  of  arrays 

•  Conversion  to  an  integer  coordinate  system 

•  Better  use  of  the  declared  procedure  types 

•  Control  over  the  mapping  of  the  data  into  memory 

These  changes  should  bring  about  1-2  orders  of  performance  improve- 
ment, and  will  make  it  feasible  to  build  a  VLSI  graphics  editor  to  get  faster 
feedback  on  the  development  process.  However,  the  issue  of  increased 
efficiency  of  the  compiled  code  is  not  a  research  issue,  and  is  only  of  interest 
to  the  implementors  (and  users)  of  the  language. 

A  more  far  ranging  question  is  that  of  increased  algorithmic  efficiency. 
How  can  GENERIC  be  combined  with  some  variant  of  the  efficiency  tech- 
niques discussed  in  chapter  7?  What  is  the  effect  of  the  proposed  design 
style  of  spreading  things  apart,  performing  manipulations  and  then 
compressing  them  together?  How  does  this  more  dynamic  form  of  manipula- 
tion (and  hence  more  global  changes  to  the  data  structure)  mesh  with  other 
techniques  for  performing  the  operations  themselves?  These  questions  can 
only  be  answered  after  the  system  is  made  sufficiently  efficient  that  statis- 
tics can  be  collected  over  a  large  and  varied  enough  problem  space. 
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8.2.2.  Graphics  editor 

It  would  be  desirable  to  build  a  graphics  editor  in  GENERIC.  WTiile 
this  may  seem  at  cross  purposes  with  the  goals  of  GENERIC,  it  is  necessary 
since  tool  implementors  will  need  graphics  for  the  foreseeable  future  both  for 
debugging  and  performance  tuning  of  the  layouts. 

Build  a  graphics  editor  using  GENERIC  should  be  straight  forward; 
GENERIC'S  operators  are  really  just  extremely  high  level  graphics  com- 
mands. And,  unlike  existing  editors,  a  GENERIC  graphics  editor  would  be 
extensible.  Hence  new  tools  could  be  added  as  needed.  In  fact,  we  perceive 
the  environment  as  a  split  graphics/programming  environment,  with  pro- 
gramming aspects  gaining  in  dominance  over  time. 

8.2.3.  Topological  sketches 

The  topological  templates  in  Flexcell  are  tedious  to  describe.  For  exam- 
ple, one  template  for  a  super  buffer  requires  eight  pages.  This  volume  could 
greatly  be  reduced  by  routines  which  performed  various  standard  and 
straight-forward  modifications.  Nonetheless,  without  higher  level  opera- 
tions, cells  will  remain  tedious  to  describe. 

What  is  needed  here  is  a  method  to  (unambiguously)  describe  a  tem- 
plate by,  for  at  least  most  of  the  information,  just  sketching  it  (textually,  not 
pictorially).  Hence,  the  adding  of  contacts  and  strip  routing,  performing 
non-crossing    routing,    and   relative    placement    and   orientation    need    to   be 

151 


described  much  more  succinctly,  and  performed  much  more  automatically. 

8.3.    Long  range  issues 

The  long  range  goal  is  to  demonstrate  a  silicon  compiler  system  whose 
layout  approaches  or  surpasses  that  which  is  produced  by  careful,  expert 
human  designers  using  full  custom  techniques.  To  meet  this  goal,  a  number 
of  subgoals  must  be  achieved. 

8.3.1.  Higher  level  operations 

The  operators  of  GENERIC  are  low  level.  Higher  level  operators,  based 
on  GENERIC  operators  must  be  built  which  will  enable  faster  construction 
(in  designer  time)  of  cells. 

8.3.2.  Metrics 

Since  a  programming  language  does  not  provide  a  means  to  graphically 
control  the  design  process,  we  must  find  ways  of  replacing  the  eyes  of  the 
designer  in  order  to  evaluate  and  optimize  the  layout.  The  development  of 
metrics  indicating  the  quality  of  circuit  is  crucial.  We  can  imagine  metrics 
of  the  following  flavors: 

(1)  Topological  evaluation  —  attempt  to  measure  whether  the  design  is  too 
complex  topologically,  and  hence  to  try  topological  manipulations. 

(2)  Density  measures  —  measures  whether  a  layout  is  densely  packed. 

(3)  Critical  path  evaluation  —  measures  the  effect  of  layout  on  the  critical 
paths  for  the  circuit. 
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8.4.    Final  remarks 

The  issues  raised  in  this  thesis  are  large,  and  answers  to  them  will 
require  a  system  of  several  hundred  thousand  lines  of  code.  However,  the 
development  of  effective  silicon  compilers  is  needed  if  the  potential  of  ULSI 
is  to  be  realized.    We  believe  that  GENERIC  is  a  step  in  the  right  direction. 
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APPENDIX    A 


Superbuffer  tempkite  code 


super  buffer  generation  package 


unspec 

sb_layout_three_inetal(gT  gnd_top_width, 
gT  gnd_bot_ width, 

gT  vdcLwidth)  sb-layout-three-Tnetal 

default  gnd_top_ width  :=  nMMetalWidth; 
default  gnd_bot_width  :=  nMMetalWidth; 
default  vdd_ width       :  =  nMMetalWidth; 

gT  w; 

int  count; 

CellT[]  inv_sb  :=  fi; 

PrimitiveT  VDD_metal  :=  fi; 

PrimitiveT  GND_metal_top  :=  Q; 

PrimitiveT  GND_metal_bot  :  =  12; 

CellT  x; 

CellT  prev_x; 

CellT  top_x; 

CellT  top  :=  fi; 

CellT  bottom  :=  Q; 

PrimitiveT  final  :=  12; 

gT  medium—distance; 

gT  large_distance; 

gT  huge_distance; 

int  plane; 

PinT[]  wp; 

inv_sb  :=  .inv_sb; 

medium_di5tance  :=  L(80); 
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large-distance    :=  2*inedium_distance; 
huge-distance     :=  2*large-di5tance; 

plane  :=  PlNewO; 

prmt('==  sb-layout-three_metal:  enter'); 

construct  each  subcell  with  metal  contacts 

wp  :=  PinIncidentWire(inv_sb[0].pull-U5». source. $point.$object);       .; 
if#wp  =  Othen 

WireSegDelete(wp[0].$object); 
else 

errorCf ,  'sb_layout_three_metal:   wire  missing  from  inverter'); 
end; 

V  (X  6  inv_sb) 

w  :=  nMTransWidth(x.pull-up,   'source'); 
x.pulLup  MOVE-UP  L(14); 

CellPrimOf('pu_pd_contact',  dm(w  -  2*nMCutOverhang),  x); 
PtCenter(x.pu_pd_contact)  ALIGN_HORIZONTAL 

x.pull-up. source; 
x.pu_pd_contact  T0_BEL0W{by:  =  L(6)}  x.pulLup. source; 
if  !(x.pu_pd-Contact  DROP  x.pulLdown.drain)   then 

errorCf,  'sb_layout-three_meta]:  bad  drop  (loop  1)'); 
end; 

connect(x.pull-up. source,        x.pu_pd_contact.diff); 
Wr(x.pull_up. source,        x.pu_pd_contact.diff,   w,  'd'); 

w  :=  w  MIN  nMTransWidth(x. pull-down,   'drain'); 
Wr(x.pu_pd_contact.diff,   x.pulLdown. drain,       w,  'd'); 
end; 
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priiit('==  sb_layout_three_metal:   past  first  loop'); 

count  :=  2; 

V  (X  €  inv^b[2..]) 

CellPrimOfCpulLup-contact',  pm(),  x); 
if  odd(count)  then 

PtCenter(x.pull_up_contact) 
ALIGN_VERTICAL 
x.pulLup.gate; 
x.pulLup-contact  TO_XEFT&DROP  x.pulLup.gate; 
coiinect{x.pulLup_contact.poly,    x.pulLup.gate); 
Wr(x.pulLup_contact.poly,   x.pulLup.gate, 
nMPoly Width,  'p'); 
else 

PtCenter(x.pulLup_contact) 
ALIGNLVERTICAL 
x.pulLup.gate2; 
x.pulLup_contact  TO_RIGHT&DROP  x.pulLup.gate2; 
connect(x.pulLup_contact.poly,    x.pulLup.gate2); 
Wr(x.pulLup_contact.poly,   x.pulLup.gate2, 
nMPolyWidth,  'p'); 
end; 

count  +:  =  1; 
end; 

prmt('==  sb_layout_three_metal:  past  second  loop"); 
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put  in  VDD  and  GROUND  lines 


VDD_metal        :=  wprimCm',  huge_distance,  vdd_ width); 
GND_inetaLtop  :=  wprimCm',  huge_distance,  gnd_top_ width); 
GND_metaLbot  :=  wprimCm',  huge_distance,  gnd_bot_ width); 
VDD_metal       TO^BOVE  large_distance; 
GND_metaLtop  TO_ABOVE  2*large_distance; 
VDD_metal        DROP  plane; 
GND_metaLtop  DROP  plane; 
GND_inetal_bot  DROP  plane; 

connect(VDD_metal.cgeoin,  Vdd); 
connect(GND_metal_bot.cgeom,   Gnd); 
connect(GND_metal_top.cgeom,  Gnd); 

prmt('==  sb_layout_three_metal:  past  VDD  and  GROUND  lines'); 
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First  configure  the  top  cells 

X  :=  inv^b[l]; 

ORIENT{180}  x; 

X  T0^B0VE{by:  =  large_distance/8}    VDD_metal; 

X  TO_RIGHT{pt2:  =  PtLL(GND_metaLtop)}   GND_metaLtop; 

X  DROP  plane; 

for  (count  :=  3,5..#inv_sb) 

X  :=  inv_sb[count]; 

prev_x  :=    inv_3b[count-2]; 

ORIENT{180}  x; 

X  TO_RIGHT{by:  =  large_distance}   prev_x; 

x  ALIGN_VERTICAL{ptl  :=  PtCenterfx.pull_up_contact), 
pt2  :=  PtCenter(prev_x.pu_pd_contact)} 
prev_x; 

X  DROP  plane; 

connect(x.pull_up_contact. metal,   prev_x.pu_pd_contact. metal); 
Wri  x.pulLup-Contact. metal, 

prev_x.pu_pd_contact. metal, 

nMMetalWidth,  'm'j; 
end; 

prmt('==  sb_layout_three_metal:  past  third  loop'); 
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Then  configure  the  bottom  cells 

for  (count  :=  0,2..#inv_sbi 

X  :=  inv_sb[count]; 

prev_x  :=    inv^b[count  —  2]; 

top_x    :=    inv_5b[count  +  l]; 

if  count +  1  ^  #inv_sb  then 

PtCenter(x)  ALIGN_HORIZONTAL  PtCenter(top_x); 
else 

X  TO_RIGHT{by:  =  large_distance}  prev_x; 
end; 

if  count  ^  2  then 

X  ALIGN_VERTICAL  {ptl:  =  PtCenter(x.pull_up_contact), 
pt2:  =  PtCenter(prev_x.pu_pd_contact )} 
prev_x; 

X  DROP  plane; 

connect(x.pull_up_contact. metal, 

prev_x.pu_pd_contact. metal); 
Wr(x.pull_up_contact. metal, 
prev_x.pu_pd_contact. metal, 
nMMetalWidth,  'm'); 
else 

X  TO_BELOW{by:=   large_distance/4}  VDD_metal; 
X  DROP  plane; 
end; 
end; 

GrNextDrawO; 

print('==  sb_layout_three_metaI:   past  fourth  loop'); 
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basic  cells  placed,  now  rivet  down  to  VDD  and  GROUND 

for  (count  :=  0..#inv_sb) 
X  :=  inv^b[count]; 

gnd_contact_width  :=  nMTransWidth(x.pull_down,   'source'); 

gnd_contact  :=  dm(gnd_con.tact_width   -  2*iiMCutOverhang); 
CellPrimOn'gnd-contact',  gnd_contact,  x); 
connect(gnd_contact. metal,   Gnd); 

PtCenter(gnd_contact)  ALIGN_HORIZONTAL 

PtCenter(x.pull_down); 

if  (count  MOD  2)  =  0  then 

PtCenter(gnd_contact) 

ALIGN_VERTICAL 
PtCenter(GND_metal_bot); 

gnd_contact  DROP  plane; 

Wr(gnd_contact. metal,  GND_metal_bot.cgeom, 
gnd_contact_width,  'm'); 

vdd_contact_ width  :=  nMTr£insWidth(x.pull_up,  'drain'); 
vdd_wire_width  :=  vdd_contact_width; 
if  count  <  #inv_5b  then 

vdd_contact_ width  :=  vdd_contact_ width  MAX 
nMTransWidth( 
inv_sb[count  +  l].pull_up, 
'source'); 
end; 

vdd_contact  :=  dm(vdd_contact_width-2*nMCutOverhang); 
CellPrimOfCvdd-Contact',  vdd_contact,  x); 
connect(vdcLcontact. metal,  Vdd); 
PtCenter(vdd_contact) 

ALIGN_VERTICAL 

PtCenter(VDD_metal); 
if  count  <  #inv_sb  then 

PtCenter(vdd_contact) 
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ALIGN_HORIZONTAL 

inv_sb[count  +  l].pull_up. drain; 
else 

PtCenter(vdd_contact) 

ALIGN_HORIZONTAL 

x.pulLup.drain; 
end; 
vdd_contact  DROP  plane; 

Wr(vdd_contact. metal,  VDD_metal.cgeom, 

vdd_ war e_ width,  'm'); 
else 

PtCenter(gnd_contact) 

ALIGN.VERTICAL 

PtCenter(GND_metaLtop); 
gnd_contact  DROP  plane; 

connect(gnd_contact. metal,  GND_metal_top.cgeom); 
Wr(gnd_contact. metal,  GND_metal_top.cgeom, 
gnd_contact_width,  'm'); 

vdd_wire_width  :=  nMTransWidth(x.pull_up,   'drain'); 
end; 

Wr(x.pulLdown. source,   gndLcontact.diff, 
gnd_contact_width,  'd'); 


Wr(x.pulLup. drain,   vdd_contact.diff,  vdd_wire_width,  'd'); 

end; 

GrNextDrawO; 

print('==  sb_Jayout_three_metal:   past  fifth  loop'); 

compute  pull  up  contacts 

contacted_to  :=  [fi,  fil; 
for 'count  :=  2..#inv_sb- 1) 
if  odd'count)  then 

contacted—to  +:  = 

[inv_sb[count  +  Ij.pulLup.gate]; 
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else 

contacted_to  +:  = 

[inv_§b[count+  l].pull_up.gate]; 
end; 
end; 

if  #inv^b  >  2  then 

contacted  :=  inv_sb[#inv_sb  — 1]; 
final  :=  pmO; 

CellPrimOfCfinal',  final,  );    . 
PtCenter(final)  ALIGN_VERTICAL 

PtCenter(contacted.pu_p<l_contact); 
final  TO_RIGHT[by:  =  5*large_distance/8}   contacted.pulLup; 
final  DROP  plane; 
contacted_to  +:=  [final.poly]; 

connect(final. metal,  contacted.pu_pd_contact.metal); 
Wr(final. metal,  contacted.pu_pd_contact.metal, 

nMCutOverhang  +  nMCutWidth, 

'm',  plane:  =  plane); 
end; 

print('==  sb_layout_three_metal:  pull_up  contacts  done'); 
GrNextDrawO; 
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finally,  wire  up  the  stages 


pt_from  :=  mv_sb[0].pull_dow'n.gate2; 

pt_to     :=  XYMove(PtXY(pt_from),  EAST,  3*large_distance/8); 
wl  :=  WireNew(pt_from,  pt_to,  'p',  plane:  =  plane); 
Wr(wl.$p2,  inv_sb[l].pull_up.gate,   plane:  =  plane); 

pt_from  :=  inv_sb[l].puIl_down.gate2; 

pt_to     :=  XYMove(PtXY(pt_from),  WEST,  3*large_distance/8); 
wl  :=  WireNew(pt_from,  pt_to,  'p',  plane:  =  plane); 
Wr(wl.$p2,  inv_sb[0].pull_up.gate,   plane:  =  plane); 


Connect  all  the  bottom  pull-up  gate2  to  the  top 
pull-down  gate 

for  (count  :  =  2,4..#inv_sb) 

bottom  :=  inv_sb[count]; 

pt_from  :=  bottom. pull_dow'n.gate2; 

pt_to     :=  XYMove(PtXY(pt_from),  EAST,  3*large_distance/8); 

wl  :=  WireNew(pt_from,  pt_to,  'p',  plane); 

Wr(wl.$p2,  contacted_to[count],  plane:  =  plane); 
end; 

for  (count  :=  3,5..#inv_sb) 
top  :=  mv_sb[count]; 

--leftside  connection 
pt_from  :=  top. pulLdown. gate; 

pt_to     :=  XYMove(PtXY(pt_from),  EAST,  5*large_distance'8); 
wl  :=  WireNew(pt_from,  pt_to,  'p',  plane); 
Wr(contacted_to(count],  wl.$p2,  plane:  =  plane); 
end; 

GrNextDraw(); 

printr==  sb_layout_three_metal:  done'); 
end; 
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APPENDIX    B 


Compact  Code 


compact  cells 


unspec 

Compact(PlaneElmtsT[]  objs,  Direction!  d)  Compact 

PlaneElmtsT[]  seed; 
PlaneElmtsT[]  compacted; 
gT  distance;  --  maximum  compaction  distance 

MarkAlKobjs,  COMPACT_FLAG); 

WireCreateFlag  :=  WireCreateFlag  OR  COMPACT_FLAG; 

[distance,  seed]  :=  CompactDirection(objs,  d); 

compact  until  minimum  object  has  been  compacted 
loop 
in  it 

compacted  :=  []; 
while 

([distance,  seed]  :=  CompactDirection(objs,  d))  & 

(3x6  compacted  st  x  =  seed[0]) 


do 


loop 

while  #seed3i#objs 

do 

distance  -:=  seed  MOVE{d}  distance; 

print('###  Compact:  distance  =  ',  distance); 
GrNextDraw(objs); 


if  distance  <  0  then 

print!' -I-+  Compact;  maximum  distance  moved   ; 
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break; 
end; 

if  pi  =  []  then 

print! '+  -t-  Compact^  pi  empty'); 

brealc; 
end; 

if  3  X  6  pi  St  !set(x.$flags,  COMP;^CT_FLAG)  then 
printf('++  Compact:  object  outside'); 
printCcompaction  set  x  =  ',  x); 
break; 

end; 

seed  :=  ts  +  pi; 
end; 
V  (x  €  seed  st  3  y  6  compacted  st  x  =  y) 

compacted  +:=  [x]; 
end; 
end; 


print('++  Compact:  finished  loop'); 

V  (plane  6  PIAIK)) 

UnMarkAll(PlGet(planel,  COMPACT_FLAG); 
end; 

WireCreateFlag  :=  WireCreateFlag  AND  (NOT  COMPACT_FLAG); 
end; 
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unspec 

CompactXYX(unspec  cell)  CompactXYX 

PlaneElintsTl]  objs; 

objs  :=  ObjSet(cell); 

CompactCobjs,  UP); 
CompactCobjs,  RIGHT); 
Compact(objs,  DOWN); 
end; 

unspec 

CompactDirection(PlaneElmtsT[]  objs,  DirectionT  d)  CompactDirection 

int  i; 

gT  distance; 
BboxT  bbox; 

if  d  =  UP  then 

i  :=  CompactDirectionSeed(objs,   1,   1); 
elseif  d  =  DOWN  then 

i  :=  CompactDirectionSeedlobjs,  3,  —1); 
elseif  d  =  RIGHT  then 

i  :=  CompactDirectionSeed(objs,  0,    1); 
elseif  d  =  LEFT  then 

i  :=  CompactDirectionSeed(objs,  2,  -1); 
end; 

bbox  :=  BbObject(objs); 

if  (d  =  NORTH)  |  (d  =  SOUTH)  then 

distance  :=  bbox[3]-bbox[l]; 
else 

distance  :=  bbox[2]-bbox[0]; 
end; 

return  [distance,  [objs[i]]]; 
end; 

int 

CompactDirectionSeed(PlaneElmtsT[]  objs,  int  coor, 

int  sign)  CompactDirectionSeed 

int  i; 

int  best-i; 

PlaneElmtsT  best_obj  :=  fi; 
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BboxT  bbox; 
gT  best; 

best-J  :=  i  :=  0; 

bbox  :=  BbObject(objs[0]); 

best  :=  bbox[coor]; 

V  (x  6  objs) 

bbox  :=  BbObject(x); 
if  (bbox[coor]*sign)    <  (best*sign)  then 
best_i  :=  i; 
best     :=  bbox[coor]; 
elseif  (bbox[coor]*sigTi)   =  (best*sign)  then 
if  type(x)  =  'PrimitiveT'  then 
best_i  :=  i; 
best     :=  bbox[coor]; 
elseif  type(objs[best_i])   *  'PrimitiveT'  & 
WireAxis(x)   ^  orientation(direction) 
then 

best_i  :=  i; 
best     :=  bbox[coor]; 
end; 
end; 
i  +:=  1; 
end; 

return  best_i; 
end; 
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